深度残差学习下的光源颜色估计被引量：6

Illuminant estimation via deep residual learning

导出

摘要目的颜色恒常性通常指人类在任意光源条件下正确感知物体颜色的自适应能力,是实现识别、分割、3维视觉等高层任务的重要前提。对图像进行光源颜色估计是实现颜色恒常性计算的主要途径之一,现有光源颜色估计方法往往因局部场景的歧义颜色导致估计误差较大。为此,提出一种基于深度残差学习的光源颜色估计方法。方法将输入图像均匀分块,根据局部图像块的光源颜色估计整幅图像的全局光源颜色。算法包括光源颜色估计和图像块选择两个残差网络:光源颜色估计网络通过较深的网络层次和残差结构提高光源颜色估计的准确性;图像块选择网络按照光源颜色估计误差对图像块进行分类,根据分类结果去除图像中误差较大的图像块,进一步提高全局光源颜色估计精度。此外,对输入图像进行对数色度预处理,可以降低图像亮度对光源颜色估计的影响,提高计算效率。结果在NUS-8和重处理的Color Checker数据集上的实验结果表明,本文方法的估计精度和稳健性较好;此外,在相同条件下,对数色度图像比原始图像的估计误差低10%15%,图像块选择网络能够进一步使光源颜色估计网络的误差降低约5%。结论在两组单光源数据集上的实验表明,本文方法的总体设计合理有效,算法精度和稳健性好,可应用于需要进行色彩校正的图像处理和计算机视觉等领域。 Objective Color constancy refers to the human ability that allows the brain to recognize an object as having a consistent color under varying illuminants. Color constancy has become an important prerequisite of high-level tasks, such as recognition, segmentation, and 3 D vision. In the computer vision community, the goal of computational color constancy is to remove illuminant color casts and obtain accurate color representations for images. Therefore, illuminant estimation is an important means to achieve computational color constancy, which is a difficult and underdetermined problem because the observed image color is influenced by unknown factors, such as scene illuminants and object reflections. Illuminant estimation methods can be categorized into two classes: statistics-based(or static) and learning-based methods. Statistics-based methods estimate the illuminant based on the statistical properties(e.g., reflectance distributions) of the image. Learning-based methods learn a model from training images then estimate the illuminant using the model. Convolutional neural networks(CNNs) are very powerful methods of estimating illuminants, and many competitive results have been obtained with CNN-based methods. We propose a CNN-based illuminant estimation algorithm in this study. We use deep residual learning to improve network accuracy and a patch-selecting network to overcome the color ambiguity issue of local patches. Method We uniformly sample local patches from the image, estimate the local illuminant of each patch individually, and generate a global illuminant estimation of the entire image by combining the local illuminants. We use a 64×64 patch size in the patch sampling to guarantee the estimation accuracy of the local illuminant and provide sufficient training inputs without data augmentation. The proposed approach includes two residual networks, namely, illuminant estimation net(IEN) and patch selection net(PSN). IEN estimates the local illuminant of image patches. To improve the estimation accuracy of IEN, we increase the feature extraction hierarchy by adding network depth and use the residual structure to ensure gradient propagation and facilitate the training of the deep network. IEN is based on the residual structure, which consists of many stacked 3×3 and 1×1 convolutional layers, batch normalization layers, and rectified linear unit layers. The remaining part is composed of one global average pooling layer and one full connection layer. We use Euclidean loss and stochastic gradient descent(SGD) to optimize IEN. PSN shares a similar architecture with IEN, except that PSN has an additional Softmax layer that serves as the classifier at the end of the network. PSN is proposed to classify image patches according to their illuminant estimation errors. We use cross entropy loss and SGD to optimize PSN. According to the results of PSN, patches with a large estimation error are removed from the entire image, thus improving the performance of global illuminant estimation. Additionally, we preprocess the input image by using the log-chrominance algorithm, which converts a three-channel RGB image into a two-channel log-chrominance image;this reduces the influence of image luminance and improves the computational efficiency by decreasing the amount of data by 1/3. Result We implement the proposed IEN and PSN on the Caffe library. To evaluate the performance of our approach, we use two standard single-illuminant datasets, namely, the NUS-8 dataset and the reprocessed ColorChecker dataset. Both datasets include indoor and outdoor images, and a Macbeth ColorChecker is placed in each image to calculate the ground truth illuminant. The NUS-8 dataset contains 1 736 images captured from 8 different cameras, and the reprocessed ColorChecker dataset consists of 568 images from 2 cameras. Following the configurations of previous studies, we report the following metrics: the mean, the median, the tri-mean, and the mean of the lowest 25% and the highest 25% of angular errors. We also report the additional metric of the 95 th percentile for the reprocessed ColorChecker dataset. We divide the NUS-8 dataset into eight subsets, apply three-fold cross-validation on the eight subsets individually, and report the geometric mean of the proposed metrics for all eight subsets. We directly apply three-fold cross-validation on the reprocessed ColorChecker dataset. Experimental results show that the proposed approach is competitive with state-of-the-art methods. For the NUS-8 dataset, the proposed IEN achieves the best results among all compared methods, and the proposed PSN can further increase the precision of the IEN results. For the reprocessed ColorChecker dataset, our results are comparable with those of other advanced methods. In addition, we conduct ablation studies to evaluate the model components of the proposed approach. We compare the proposed IEN with several shallower CNNs. Experimental results show that deep residual learning is effective in improving illuminant estimation accuracy. Moreover, compared with the estimated illuminant on the original image, log-chrominance preprocessing can reduce the illuminant estimation error by 10% to 15%. The proposed PSN can further decrease the global illuminant estimation error by 5% compared with the method that uses IEN alone. Finally, we evaluate the time cost of our method on a PC with an Intel i5 2.7 GHz CPU, 16 GB of memory, and an NVIDIA GeForce GTX 1080 Ti GPU. Our code takes less than 1.4 s to estimate a 2 K image, which has a typical resolution of 2 048×1 080 pixels. Conclusion Experiments on the two single-illuminant datasets show that the proposed approach, which includes log-chrominance preprocessing, deep residual learning-based network structure, and patch selection for global illuminant estimation, is reasonable and effective. The proposed approach has high precision and robustness and can be widely used in image processing and computer vision systems that require color calibrations.

作者崔帅张骏高隽 Cui Shuai;Zhang Jun;Gao Jun(School of Computer Science and Information Engineering,Hefei University of Technology,Hefei 230601,China)

机构地区合肥工业大学计算机与信息学院

出处《中国图象图形学报》 CSCD 北大核心 2019年第12期2111-2125,共15页 Journal of Image and Graphics

基金国家自然科学基金项目(61876057,61403116)~~

关键词视觉光学颜色恒常性光源颜色估计深度残差学习对数色度 visual optics color constancy illuminant estimation deep residual learning log-chrominance

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

作者简介崔帅,1986年生,男,博士研究生,主要研究方向为人工智能与机器人技术。E-mail:baalme@163.com;通信作者:张骏,女,副研究员,主要研究方向为计算机视觉、图像处理、机器学习。E-mail:zhangjun@hfut.edu.cn;高隽,男,教授,博士生导师,主要研究方向为图像处理、模式识别、神经网络理论及应用、光电信息处理、智能信息处理:E-mail:gaojun@hfut.edu.cn。

引文网络
相关文献

参考文献5

1段志刚,李勇,王恩德,田建东,唐延东.基于光照不变图像的阴影图像道路及导航线提取算法[J].光学学报,2016,36(12):199-206. 被引量：10
2毕笃彦,库涛,查宇飞,张立朝,杨源.基于颜色属性直方图的尺度目标跟踪算法研究[J].电子与信息学报,2016,38(5):1099-1106. 被引量：22
3黄冬梅,王龑,宋巍,王振华,杜艳玲.不同颜色模型下自适应直方图拉伸的水下图像增强[J].中国图象图形学报,2018,23(5):640-651. 被引量：31
4吴克伟,杨学志,谢昭.面向区域的非均匀光照估计方法[J].光学学报,2016,36(2):320-328. 被引量：6
5崔帅,张骏,高隽.对数域中基于实例学习的光照估计[J].光学学报,2018,38(2):390-399. 被引量：4

二级参考文献32

1华希俊,廖茜,陈美云,王木菊.阴影环境下拖拉机视觉导航路径识别方法研究[J].农机化研究,2012,34(4):181-184. 被引量：4
2POSSEGGER H, MAUTHNER T, and BISCHOF H. In defense of color-based model-free tracking[C]. IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, 2015: 2113-2120.
3ORON S, BAR-HILLEL A, LEVI D, et al. Locally orderless tracking[C]. IEEE Conference on Computer Vision and Pattern Recognition, Rhode Island, USA, 2012: 1940-1947.
4MEER P, RAMESH V, and COMANICIU D. Kernel-based object tracking[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25(5): 564-575.
5Van de WEIJER J, SCHMID C, and VERBEEK J. Learning color names from real-world Images[C]. IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, Minnesota, USA, 2007: 1-8.
6Van de WEIJER J, SCHMID C, VERBEEK J, et al. Learning color names for real-world applications[J]. IEEE Transactions on Image Processing, 2009, 18(7): 1512-1523.
7KHAN F S, Van de WEIJER J, and VANRELL M. Modulating shape features by color attention for object recognition[J]. International Journal of Computer Vision, 2012, 98(1): 49-64.
8KHAN F S, ANWER R M, Van de WEIJER J, et al. Color attributes for object detection[C]. IEEE Conference on Computer Vision and Pattern Recognition, Rhode Island, USA, 2012: 3306-3313.
9DANELLJAN M, KHAN F S, FELSBERG M, et al. Adaptive color attributes for real-time visual tracking[C]. IEEE Conference on Computer Vision and Pattern Recognition, Columbus, USA, 2014: 1090-1097.
10COMANICIU D, RAMESH V, and MEER P. Real-time tracking of non-rigid objects using mean shift[C]. IEEE Conference on Computer Vision and Pattern Recognition, Hilton Head, SC, USA, 2000: 142-149.

共引文献67

1美荣.山里人的乐器[J].岁月,2000(7):44-44.
2唐红梅,吴士婧,郭迎春,裴亚男.自适应阈值分割与局部背景线索结合的显著性检测[J].电子与信息学报,2017,39(7):1592-1598. 被引量：15
3黄德天,顾培婷,柳培忠,黄炜钦.改进的自适应核相关滤波目标跟踪[J].华侨大学学报（自然科学版）,2017,38(5):693-698.
4田鹏,吕江花,马世龙,汪溁鹤.基于局部差别性分析的目标跟踪算法[J].电子与信息学报,2017,39(11):2635-2643. 被引量：4
5王艳川,黄海,李邵梅,王亚文.基于双模型融合的自适应目标跟踪算法[J].计算机应用研究,2017,34(12):3828-3833. 被引量：4
6王慧玲,许宁,杨景元,雷魏.利用红外光谱仪的网球比赛运动目标跟踪方法[J].湘潭大学自然科学学报,2018,40(2):46-49. 被引量：2
7闫河,张杨,杨晓龙,王鹏,董莺艳.一种抗遮挡核相关滤波目标跟踪算法[J].光电子．激光,2018,29(6):647-652. 被引量：6
8王全宁,周进,雷涛.基于灰色关联分析的目标跟踪器性能评估方法[J].国外电子测量技术,2018,37(6):39-43. 被引量：4
9梅建军,张为.基于ViBe与机器学习的早期火灾检测算法[J].光学学报,2018,38(7):52-59. 被引量：11
10黄立勤,朱飘.车载视频下改进的核相关滤波跟踪算法[J].电子与信息学报,2018,40(8):1887-1894. 被引量：3

同被引文献26

1赵建华,杨德国,陈建武,朱永久,李茜,冯宪斌,何勇凤,吴兴兵.鱼类应激生物学研究与应用[J].生命科学,2011,23(4):394-401. 被引量：36
2张海艳,沙兆林,崔世海,朱银燕,李晓东.POV-Ray软件在分子对称性中的应用[J].大学化学,2015,30(2):78-82. 被引量：5
3刘万奎,刘越.用于增强现实的光照估计研究综述[J].计算机辅助设计与图形学学报,2016,28(2):197-207. 被引量：24
4吴克伟,杨学志,谢昭.面向区域的非均匀光照估计方法[J].光学学报,2016,36(2):320-328. 被引量：6
5曹力,顾兆光,孙健,王文平.界面预制:一种高效生成原子模型的方法[J].计算机辅助设计与图形学学报,2016,28(10):1622-1629. 被引量：1
6崔岩.一种基于模板匹配的ColorChecker定位分割算法[J].现代计算机（中旬刊）,2016,0(10):78-80. 被引量：1
7王飞,王伟.一种暗通道优先的快速自动白平衡算法[J].光电工程,2018,45(1):73-79. 被引量：7
8宋巍,王龑,黄冬梅,贺琪,王振华.结合背景光融合及水下暗通道先验和色彩平衡的水下图像增强[J].模式识别与人工智能,2018,31(9):856-868. 被引量：9
9代成刚,林明星,王震,张东,管志光.基于亮通道色彩补偿与融合的水下图像增强[J].光学学报,2018,38(11):78-87. 被引量：43
10田甜,张建明,张德志.4种长江珍稀鱼类常见疾病快诊速查检索表的构建[J].水产科技情报,2020,47(1):49-55. 被引量：4

引证文献6

1杨泽鹏,解凯,李桐,杨梦瑶,杨斌.多通道置信度加权颜色恒常性算法[J].光学学报,2021,41(11):234-244. 被引量：4
2杨泽鹏,解凯,李桐.渐进式多尺度特征级联融合颜色恒常性算法[J].光学学报,2022,42(5):244-256. 被引量：4
3吴晨,曹力,秦宇,吴苗苗,顾兆光.基于参考图像的原子模型渲染方法[J].图学学报,2022,43(6):1080-1087.
4闫鹏刚.联合色度特征与轻量级神经网络的颜色恒常性算法[J].电脑知识与技术,2023,19(20):47-50.
5李佳康,张胜茂,吴祖立,石永闯,唐峰华.养殖水体中校色卡识别与色彩变化分析[J].渔业信息与战略,2024,39(1):49-60.
6解梦达,孙鹏,郎宇博.白斑假设下基于灰色区域扩展的色彩恒常算法[J].计算机辅助设计与图形学学报,2024,36(12):1932-1945.

二级引证文献7

1谢林芳,张旭东,孙锐,范之国,党天一.基于多对一映射生成对抗网络的颜色恒常性算法[J].电子测量与仪器学报,2022,36(4):124-135.
2吴萌,王姣,相建凯.古铜镜X光生成对抗融合中的优化策略[J].激光与光电子学进展,2023,60(2):456-465. 被引量：1
3高强,马瑞青,强彦.色相测验和色盲检查镜对异常色觉的检测和分类[J].激光与光电子学进展,2023,60(9):491-498. 被引量：3
4李悦敏,徐海松,黄益铭,杨敏航,胡兵,张云涛.应用环境光传感器的颜色恒常性算法[J].光学学报,2023,43(14):310-317.
5胡待方,仝秋红,柴国庆,王凯,穆雨薇,苏胜君.雨天车辆检测的两阶段渐进式图像去雨算法[J].激光与光电子学进展,2023,60(22):103-112. 被引量：2
6刘予敏,林珊玲,林志贤,郭太良.不同色温环境光下彩色电润湿电子纸的色彩校正[J].液晶与显示,2024,39(1):32-39. 被引量：2
7赵成诚,苗作华,朱良建,刘代文,陈澳光.针对车辆环视系统的光照补偿方法[J].激光杂志,2024,45(7):276-284.

1左荻.七年级语文阅读教学主问题设计的有效性[J].文学少年,2019,0(8):0121-0121.
2马瑞青,廖宁放.RGB-LED光源下光源色度对颜色恒常性的影响[J].光学学报,2019,39(9):418-426. 被引量：8
3陈浩.图像经典边缘检测算子的研究与比较[J].电脑编程技巧与维护,2019,0(12):150-152. 被引量：17
4李高镇.层次化网络安全威胁态势量化评估方法[J].信息周刊,2019,0(48):0139-0139.
5毛鹏,苗航.关于实体关系信息在答案选择网络中应用[J].电子技术与软件工程,2019,0(24):23-25.
6Yuhan Kang,Fenlin Liu,Chunfang Yang,Xiangyang Luo,Tingting Zhang.Color Image Steganalysis Based on Residuals of Channel Differences[J].Computers, Materials & Continua,2019(4):315-329.
7王雁.《波前像差与临床视觉矫正》一书出版[J].临床眼科杂志,2019,27(6):513-513.
8姚琼,遇琪,周希,宫良,聂明杰.广电网络承载视音频业务的性能指标及控制方法研究[J].广播与电视技术,2019,0(9):62-68.
9钱蓉,李小金,董伟,王重龙.基于RGB颜色空间的猪肉大理石纹分割[J].江苏农业科学,2019,47(20):200-203. 被引量：3
10史晶.OPNET仿真平台系统层次化建模方法研究[J].数码世界,2019,0(12):200-201.

中国图象图形学报

2019年第12期

浏览历史

内容加载中请稍等...

深度残差学习下的光源颜色估计被引量：6

参考文献5

二级参考文献32

共引文献67

同被引文献26

引证文献6

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

深度残差学习下的光源颜色估计 被引量：6

参考文献5

二级参考文献32

共引文献67

同被引文献26

引证文献6

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

深度残差学习下的光源颜色估计被引量：6