基于单透镜点扩散函数调控的深度估计

Depth Estimation Based on Single-Lens Point Spread Function Regulation

导出

摘要深度估计任务的一种解决思路是使成像光学系统的点扩散函数呈现深度关联的形式,即不同深度处的点扩散函数呈现出不同的空间或光谱结构,通过成像过程将点扩散函数中包含的深度信息编码进传感器图像中,并通过算法反演场景深度。提出一种成像系统的设计方法,使用单透镜成像系统实现上述深度估计思路。通过搭建可微深度场景成像仿真模型,设计约束方法,对单透镜光学系统中多波长、深度变化、视场变化的4维点扩散函数进行调控,并设计神经网络学习传感器编码图像中包含的深度信息,最终设计出包含极简光学系统的单目深度估计模型,以替代现有包含透镜组和相位调制光学元件的复杂光学系统。使用NYU Depth V2数据集在1~5 m的深度范围内进行训练和测试,深度估计的平均相对误差可达8.41%,该设计在低成本小型化深度成像探测领域具有一定的应用潜力。 Objective The awareness of depth is important for many computer vision tasks,such as autonomous driving and 3D reconstruction.Existing depth estimation methods include methods using structured light,time of flight,stereo-based depth estimation and monocular depth estimation.Monocular depth estimation has significant advantages in terms of power consumption,cost,and size.There are two ways to achieve monocular depth estimation.One is to learn the depth clues contained in the scene itself in the image through the neural network,such as texture gradients,shading,occlusions,and the type and size of the object.But this method is lack of interpretability.The other is depth from defocus(DFD)or depth estimation based on coded aperture,which estimates depth by identifying depth-related optical features in the optical system.This method often requires the addition of a phase mask to the existing camera,and does not utilize the depth information encoding ability of the lens itself.Therefore,we propose a depth estimation method based on single-lens optical system,which further reduces the size and volume of the optical system on the basis of existing single-lens depth estimation,while improving the accuracy of depth estimation.Methods A monocular depth estimation system usually consists of a camera and diffractive optical elements.The implementation method involves designing a diffraction optical element such that the point spread functions(PSFs)at different depths present different spatial or spectral structures.Using optical system to encode scene depth information into 2D images,then a decoding algorithm is used to decode the encoded image to estimate the scene’s depth map.The singlelens optical system also has the capability to present different spatial or spectral structures for objects at varying depths.For this specific imaging system of a single lens,we propose a simulation model for the four-dimensional point spread function that varies with multi-wavelength,depth-aware,spatially-variant four-dimensional point spread functions along with a differentiable optical imaging model.We then introduce an optimization constraint method for depth estimation tasks,which regulates the point spread function in both depth and field dimensions.For the depth estimation algorithm,we should consider that depth estimation methods relying on different spatial or spectral structures of PSFs are quite dependent on the object’s texture in the scene.Since the semantic information of the scene can compensate for this limitation,we propose a depth estimation network that includes a semantic information extraction preprocessing model.We connect the imaging model and depth estimation algorithm,jointly designing the single-lens optical system and the depth estimation algorithm.Finally,a visible light depth detector which includes an aspherical single-lens optical system and corresponding depth estimation algorithm is designed.Results and Discussions To verify our method,we train and test it on the NYU Depth V2 dataset,set the target depth range from 1.0 m to 5.0 m,The initial single-lens optical system,characterized by a focal length of 31.4 mm and a field of view of approximately 10°,is optimized along with its corresponding depth estimation algorithm.we compare our method with three alternative approaches:1)the conventional depth from defocus model,which treats the lens as a thin lens;2)phase coded-aperture model implemented with a diffractive optical element size of 256×256;3)phase coded-aperture model which has a phase mask with several concentric rings.Our designed single-lens depth estimation model achieved a relative error of as low as 0.083 on the NYU Depth V2 dataset,demonstrating the lowest relative error among the compared methods.To further evaluate the contribution of our proposed method,we conducted ablation experiments.Specifically,we replaced the optimized single-lens optical system with an unoptimized version and substituted the semantic information extraction preprocessing step with a neural network lacking this preprocessing capability.Both modifications resulted in a degradation of depth estimation accuracy,thus substantiating the effectiveness of our method in improving the depth estimation model.Conclusions We introduce an end-to-end single-lens depth estimation model.Firstly,in order to accurately simulate the out-of-focus and off-axis aberrations in the real camera lens in the depth scene,we propose a differentiable imaging model.Then,we introduce a single lens optimization constraint method to regulate the point spread functions of a single lens optical system to improve the depth dependence of the imaging response features of the optical system,so that the single lens can be optimized along the direction of maximizing the depth estimation performance of the model.In this paper,a preprocessing method combining semantic information is proposed to make up for the lack of dependent image texture in decoding process.Finally,by jointly optimizing the single-lens optical system and the depth estimation algorithm,the depth estimation model based on the minimalist optical system is realized,and the simulation and test are carried out on the NYU Depth V2 dataset.The results show that the design method can greatly reduce the volume of the depth estimation system while maintaining a high depth estimation performance.It has certain significance in the application of unmanned aerial vehicle platform distance sensor and other fields.

作者孙再武谭凡教于鹏亮李宗岭张荣帅杨昌健侯晴宇 Sun Zaiwu;Tan Fanjiao;Yu Pengliang;Li Zongling;Zhang Rongshuai;Yang Changjian;Hou Qingyu(Research Center for Space Optical Engineering,School of Astronautics,Harbin Institute of Technology,Harbin 150001,Heilongjiang,China;Harbin Xinguang Optic-Electronics Technology Co.,Ltd,Harbin 150036,Heilongjiang,China)

机构地区哈尔滨工业大学航天学院空间光学工程研究中心哈尔滨新光光电科技股份有限公司

出处《光学学报》北大核心 2025年第11期92-101,共10页 Acta Optica Sinica

基金国家自然科学基金(62375067)。

关键词成像系统计算成像单透镜成像仿真单目深度估计点扩散函数 imaging system computational imaging single lens imaging simulation monocular depth estimation point spread function

分类号 O436 [机械工程—光学工程]

作者简介通信作者:侯晴宇,houqingyu@126.com。

引文网络
相关文献

参考文献4

1顿雄,付强,李浩天,孙天成,王建,孙启霖.计算成像前沿进展[J].中国图象图形学报,2022,27(6):1840-1876. 被引量：13
2张越,蔡怀宇,盛婧,汪毅,陈晓冬.基于双螺旋相位板的单目三维编码成像[J].光学学报,2024,44(9):77-92. 被引量：1
3王家骏,刘越,吴宇晖,沙浩,王涌天.基于平面系数表示的自适应深度分布单目深度估计方法[J].光学学报,2023,43(14):150-160. 被引量：6
4肖磊,胡鹏,马俊杰.局部注意力作用下基于全局信息关联的自监督单目深度估计模型[J].激光与光电子学进展,2025,62(8):215-223. 被引量：1

二级参考文献40

1李晓晨,姚素英,黄碧珍,郑炜.一种应用于高动态范围CMOS图像传感器的曝光控制技术[J].传感技术学报,2013,26(3):328-332. 被引量：15
2胡燕翔,万莉.大动态范围多曝光图像融合方法[J].计算机工程与应用,2014,50(1):153-155. 被引量：6
3吕伟振,刘伟奇,魏忠伦,康玉思,冯睿,杨建明.基于DMD的高动态范围成像光学系统设计[J].红外与激光工程,2014,43(4):1167-1171. 被引量：13
4王延杰,陈怀章,刘艳滢,孙宏海,杨振永,何舒文.数字微镜器件在高动态辐射场景成像探测系统中的应用[J].光学精密工程,2014,22(9):2508-2517. 被引量：9
5朴永杰,徐伟,王绍举,陶淑苹.高动态范围视频的多曝光图像序列快速融合[J].液晶与显示,2014,29(6):1032-1041. 被引量：4
6江燊煜,陈阔,徐之海,冯华君,李奇,陈跃庭.基于曝光适度评价的多曝光图像融合方法[J].浙江大学学报（工学版）,2015,49(3):470-475. 被引量：11
7陈阔,冯华君,徐之海,李奇,陈跃庭.细节保持的快速曝光融合[J].浙江大学学报（工学版）,2015,49(6):1048-1054. 被引量：5
8吕涛,付东辉,陈小云,刘杰.利用DMD获取高动态范围图像技术[J].中国光学,2015,8(4):644-650. 被引量：6
9芦碧波,李玉静,郑艳梅,王玉琨.视觉自适应多尺度对数色调映射算法[J].小型微型计算机系统,2017,38(3):625-629. 被引量：6
10都琳,孙华燕,王帅,高宇轩,齐莹莹.针对动态目标的高动态范围图像融合算法研究[J].光学学报,2017,37(4):101-109. 被引量：19

共引文献17

1张欣,乔继红,张慧妍,张雁,张鑫,许继平.基于颜色空间的彩色图像颜色评价[J].液晶与显示,2023,38(11):1490-1502.
2冯雨欣,厉小润,丁楫刚.高速飞行条件下长曝光图像复原方法[J].激光与红外,2023,53(10):1610-1616. 被引量：3
3赵以,赵娟宁,孙连山.基于多级残差融合的复杂纹理光场图像深度估计[J].智能计算机与应用,2024,14(2):100-105.
4乔敏达,白林阁,王书恒,王天宇,董雪,相萌,刘飞,刘金鹏,邵晓鹏.计算成像技术中的点扩散函数工程[J].数据采集与处理,2024,39(2):271-296.
5贺天悦,寇廷栋,张启灿,陈文静,申俊飞.计算成像技术在信息复原及增强中的研究进展(特邀)[J].激光与光电子学进展,2024,61(2):466-479. 被引量：2
6王少颖,蒋世磊,张锦,孙国斌,赵金,刘卫国,周璇,魏习江.宽波段环形孔径光学-数字联合成像系统设计[J].激光与光电子学进展,2024,61(4):155-161. 被引量：2
7张晗,冯永利,王宝中.面向焊接场景的自适应权重高动态图像生成算法[J].制造业自动化,2024,46(4):97-101.
8原天宇,代祥俊,杨福俊.单目立体视觉中的系统误差与棱镜位姿对视场的影响评估[J].光学学报,2024,44(8):94-105.
9张越,蔡怀宇,盛婧,汪毅,陈晓冬.基于双螺旋相位板的单目三维编码成像[J].光学学报,2024,44(9):77-92. 被引量：1
10黄军杰,徐锋,罗亮,陈天宝.基于掩模和自监督学习的海浪三维重建[J].激光与光电子学进展,2024,61(14):389-397. 被引量：1

1张倩,叶晴莹,蔡思仪,陈水源,吴志明.矩阵法在理想光学系统问题中的应用[J].物理通报,2025(1):58-64.
2丁志辉,郭峰旭,刘映利,李海峰,吴仍茂.基于自适应联合校准的多平面相位恢复技术[J].光学学报,2025,45(10):69-81.
3林艺华,李刚,徐伟龙,闫栋,赵一轩.基于网格矢高的自由曲面反射镜设计方法[J].光学技术,2025,51(2):164-168.
4王子昌,邵烁婷,袁红军,李天贻,刘雨熙,刘会亚,王秋平,唐桧波,况龙钰,胡广月.用于硬X射线探测的罗斯对滤片堆栈混合谱仪[J].强激光与粒子束,2025,37(8):1-11.
5杨主伦,刘烨斌,举雅琨,刘琼,李旭涛,尹亚光,杨铀,刘文予.场景重光照研究综述[J].中国图象图形学报,2025,30(6):1543-1575.
6王强,王霞,唐大富,张骞,任丽萍,张志成.基于改进YOLOv5的压缩氢气瓶表面缺陷识别方法研究[J].实验室检测,2025,3(12):47-49.
7杨佳熙,于乐天,包骐瑞,毕胜,麻晓斗,杨晟琦,姜雨彤,方建儒,魏小鹏,杨鑫.面向高光子通量环境的目标深度估计方法[J].图学学报,2025,46(4):756-762.
8盛雷,李丽娟,付西红,林雪竹,郭丽丽.基于KAN-Transformer的离轴三反装调仿真技术[J].光学学报,2025,45(5):149-161.
9冉晨汛,辛静,张启灿,王亚军.基于神经辐射场的新视角条纹相位图像生成框架[J].光学与光电技术,2025,23(2):93-101.
10杨文庆,霍茨,孟贺岩,张景谋,李芳凝,张佳杰,孙金阳,张程,李显业,孙宝清.宽带光谱结构编码计算光谱成像研究进展[J].红外与激光工程,2025,54(7):209-232.

光学学报

2025年第11期

浏览历史

内容加载中请稍等...

基于单透镜点扩散函数调控的深度估计

参考文献4

二级参考文献40

共引文献17

相关作者

相关机构

相关主题

浏览历史