综合结构和纹理特征的场景识别被引量：4

Scene recognition combining structural and textural features

导出

摘要当前在计算机视觉领域,场景识别尽管取得了较大进展,但其对于计算机视觉而言,仍然是一个极具挑战的问题.此前的场景识别方法,有些需要预先手动地对训练图像进行语义标注,并且大部分场景识别方法均基于"特征袋"模型,需要对提取的大量特征进行聚类,计算量和内存消耗均很大,且初始聚类中心及聚类数目的选择对识别效果有较大影响.为此本文提出一种不基于"特征袋"模型的无监督场景识别方法.先通过亚采样构建多幅不同分辨率的图像,在多级分辨率图像上,分别提取结构和纹理特征,用本文提出的梯度方向直方图描述方法表示图像的结构特征,用Gabor滤波器组和Schmid滤波集对图像的滤波响应表示图像的纹理特征,并将结构和纹理特征作为相互独立的两个特征通道,最后综合这两个特征通道,通过SVM分类,实现对场景的自动识别.分别在Oliva,Li Fei-Fei和Lazebnik等的8类、13类和15类场景图像库上进行测试实验,实验结果表明,梯度方向直方图描述方法比经典的SIFT描述方法,有着更好的场景识别性能;综合结构和纹理特征的场景识别方法,在通用的三个场景图像库上取得了很好的识别效果. Automatic recognition of the contents of a scene is an important issue in the field of computer vision. Although considerable progress has been made, the complexity of scenes remains an important challenge to computer vision research. Most previous approaches for scene recognition are based on the so-called ＂bag of visual words＂ model, which uses clustering methods to quantize numerous local region descriptors into a codebook. The size of the codebook and the selection of initial clustering centers greatly affect the performance. ~rthermore, the large size of the codebook leads to high computational costs and large memory consumption. To overcome these weaknesses, we present an unsupervised natural scene recognition approach that is not based on the ＂bag of visual words＂ model. This approach constructs multiple images of different resolutions and extracts structural and textural features from these images. The structural features are represented by weighted histograms of the gradient orientation descriptor, which is presented in this paper, and the textural features are represented by filter responses of Gabor filters and a Schmid set. We regard the structural and textural features as two independent feature channels, and combine them to realize automatic categorization of scenes using a support vector machine. We then evaluated our approach using three commonly used datasets with various scene categories. Our experiments demonstrate that the weighted histograms of the gradient orientation descriptor outperform the classical scale invariant feature transform descriptor in natural-scene recognition, and our approach achieves good performance with respect to current state-of-the-art methods.

作者周莉胡德文周宗潭

机构地区国防科技大学机电工程与自动化学院

出处《中国科学：信息科学》 CSCD 2012年第6期687-702,共16页 Scientia Sinica(Informationis)

基金国家自然科学基金(批准号:60835005 60736018) 国家重点基础研究发展计划(批准号:2007CB311001) 湖南省高校科技创新团队资助项目

关键词场景识别结构特征纹理特征特征融合梯度方向直方图 scene recognition, structural feature, textural feature, feature combination, weighted histograms of gradient orientation descriptor

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

作者简介通信作者．E—mail：li—zhou@yahoo．cn， ZHOU Li was born in 1982. He received the B.S. and M.S. degrees from Dalian Navy Academy in 2004 and 2006, respectively. He is currently working toward the doctoral degree in National University of Defense Technology. His research interests include computer/biological vision, visual navigation, and machine learning. dwhu@nudt．edu．cn，HU DeWen was born in 1963. He received the B.S. and M.S. degrees from Xi＇an Jiaotong University in 1983 and 1986, respectively. From 1986, he was with National University of Defense Technology. From October 1995 to October 1996, he was a Visiting Scholar with the University of Sheffield, UK. He got his Ph.D. degree from National University of Defense Technology in 1999.He was promoted Professor in 1996. His research interests include image processing, system identification and control, neural networks, and cognitive science. He is an action editor of neural networks. narcz@163．comZHOU ZongTan was born in 1969. He received the B.S., M.S. and Ph.D. degrees from National University of Defense Technology in 1990, 1994 and 1998, respectively. From February 2010 to February 2011, He was a Visiting Scholar with the Eberhard Karls Uni- versitt Tfibingen. Professor in 2007 He was promoted His research interests include image/signal processing, comouter/biologica.l vision, neural net-works, cognitive neuroscience and brain-computer interface.

引文网络
相关文献

参考文献18

1CHEN Jing,WANG YongTian,GUO JunWei,LIU Wei,LIN JingDun,XUE Kang,LIU Yue,DING GangYi.Augmented reality registration algorithm based on nature feature recognition[J].Science China(Information Sciences),2010,53(8):1555-1565. 被引量：12
2J. Zhang,M. Marsza?ek,S. Lazebnik,C. Schmid.Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study[J].International Journal of Computer Vision.2007(2)
3Julia Vogel,Bernt Schiele.Semantic Modeling of Natural Scenes for Content-Based Image Retrieval[J].International Journal of Computer Vision.2007(2)
4David G. Lowe.Distinctive Image Features from Scale-Invariant Keypoints[J].International Journal of Computer Vision.2004(2)
5Krystian Mikolajczyk,Cordelia Schmid.Scale & Affine Invariant Interest Point Detectors[J].International Journal of Computer Vision.2004(1)
6Pedro F. Felzenszwalb,Daniel P. Huttenlocher.Efficient Graph-Based Image Segmentation[J].International Journal of Computer Vision.2004(2)
7Antonio Torralba.Contextual Priming for Object Detection[J].International Journal of Computer Vision.2003(2)
8Aude Oliva,Antonio Torralba.Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope[J].International Journal of Computer Vision.2001(3)
9Bosch A,Zisserman A,Muoz X.Image classification using random forests and ferns[].Proceedings of the th International Conference on Computer Vision.2007
10Thorpe S,Fize D,Marlot C.Speed of processing in the human visual system[].Nature.1996

二级参考文献14

1Azuma R, Baillot Y, Behringer R, et al. Recent advances in augmented reality. Comput Graph Appl, 2001, 21:34-47.
2State A, Chen D T, Chris T, et al. Case study: observing a volume-rendered fetus within a pregnant patient. In: Proceeding of IEEE Visualization. Los Alamitos: IEEE Computer Society Press, 1994. 364-368.
3Zaeh M, Vogl W. Interactive laser-projection for programming industrial robots. Manufact Tech, 2008, 57:37-40.
4Stricker D, Daehne P, Seibert F, et al. Design and development issues for an archeoguide: an augmented reality based cultural heritage on-site guide. In: International Conference on Augmented, Virtual Environments and Three- Dimensional Imaging, Mykonos, Greece, 2001.
5Papagiannakis G, Schertenleib S. Mixing virtual and real scenes in the site of ancient Pompeii. J Comput Animat Virtual Worlds, 2005, 16:11-24.
6Julier S, Baillot Y, Lanzagorta M. BARS: batterfield augmented reality system. In: NATO Symposium on Information Processing Techniques for Military Systems. Istanbul: IEEE Computer Society Press, 2000. 9-11.
7Gerhard R, Drummond T. Going out: robust model-based tracking for outdoor augmented reality. In: The Proceeding of Symposium on Augmented Reality. Santa Barbara: IEEE Computer Society Press, 2006. 109-118.
8Simon G, Fitzgibbon A, Zisserman A. Markerless tracking using planar structures in the scene. In: Proc International Symposium on Augmented Reality, Munich, 2000.
9Lowe D G. Distinctive image features from scale-invariant keypoints. Int J Comput Vision, 2004, 60:91-110.
10Lepetit V, Pilet J, Fua P. Point matching as a classification problem for fast and robust object pose estimation. In: Conference on Computer Vision and Pattern Recognition. Washington: IEEE Computer Society Press, 2004. 244-250.

共引文献11

1CHEN JianJun,ZHANG SuoFei,AN GuoCheng,WU ZhenYang.A generalized mean shift tracking algorithm[J].Science China(Information Sciences),2011,54(11):2373-2385. 被引量：8
2陈建军,张索非,安国成,吴镇扬.广义均值移动跟踪算法[J].中国科学：信息科学,2011,41(12):1436-1449. 被引量：1
3胡佳,汤光明,孙怡峰,刘佳.基于CenSurE特征的虚实配准方法研究[J].计算机应用研究,2013,30(6):1910-1913. 被引量：1
4ZHOU Li,HU DeWen,ZHOU ZongTan.Scene recognition combining structural and textural features[J].Science China(Information Sciences),2013,56(7):221-234. 被引量：7
5刘嘉敏,陈烁,段勇,秦勇旭.基于多色彩标识的跟踪及交互方法[J].系统仿真学报,2014,26(12):2928-2933. 被引量：4
6陈靖,孙源.基于FAST关键点的增强现实跟踪注册算法[J].北京理工大学学报,2015,35(4):421-426. 被引量：7
7何宁,晁建刚,许振瑛,陈炜.增强现实航天飞行训练系统空间定位[J].航天医学与医学工程,2018,31(2):255-260. 被引量：3
8闫兴亚,崔晓云,赵杰,刘伟.基于云存储服务的AR博物馆系统研究[J].计算机工程与应用,2017,53(16):104-109. 被引量：4
9Yufen Wu,Yi Lv,Dawei Wang,Yang Xue,Shuhong Xu.A human hybrid tracking and localization method for mixed reality simulation of complex system[J].International Journal of Modeling, Simulation, and Scientific Computing,2022,13(2):187-199.
10徐创学,谢云明,李杰,申大伟,马东森,吴水龙,戴晖.基于增强现实(AR)技术的发电厂智能巡检终端开发[J].物联网技术,2023,13(4):123-126. 被引量：7

同被引文献47

1曹琼,郑红,李行善.一种基于纹理特征的卫星遥感图像云探测方法[J].航空学报,2007,28(3):661-666. 被引量：32
2Baddeley R J, Tatler B W. High frequency edges (but not contrast) predict where we fixate: a Bayesian system identification analysis. Vision Res, 2006, 46:2824-2833.
3Bartolucci M, Smith A T. Attentional modulation in visual cortex is modified during perceptual learning. Neuropsy- chologia, 2011, 49:3898-3907.
4Braddick O, Atkinson J. Development of human visual function. Vision Res, 2011, 51: 1588-1609.
5Herzog M H, Fahle M. Effects of grouping in contextual modulation. Nature, 2002, 415:433-436.
6Willmore B D B, Bulstrode H, Tolhurst D J. Contrast normalization contributes to a biologically-plausible model of receptive-field development in primary visual cortex (V1). Vision Res, 2012, 54:49-60.
7Grossberg S, Mingolla E, Ross W. Visual brain and visual perception: how does the cortex do perceptual grouping? Trends Neurosci, 1997, 20:106-111.
8Li Z. A neural model of contour integration in the primary visual cortex. Neural Comput, 1998, 10:903-940.
9Grigorescu C, Petkov N, Westenberg M A. Contour detection based on nonclassical receptive field inhibition. IEEE Trans Image Process, 2003, 12:729-739.
10Li C. Integration fields beyond the classical receptive field: organization and functional properties. News Physiol Sci, 1996, 11:181-186.

引证文献4

1唐奇伶,桑农,刘海华,陈心浩.视觉感知结合学习的自然图像轮廓检测[J].中国科学：信息科学,2013,43(9):1124-1135. 被引量：5
2郑永荣,袁家政,刘宏哲,李超.基于单目视觉的智能车路口实时定位方法[J].计算机工程,2017,43(9):288-299. 被引量：6
3汪宇玲,黎明,李军华,张聪炫,陈昊.基于BoF模型的多特征融合纹理图像分类[J].北京航空航天大学学报,2018,44(9):1869-1877. 被引量：6
4杨育婷,李玲玲,刘旭,焦李成,刘芳,马文萍.基于多尺度-多方向Transformer的图像识别[J].计算机学报,2025,48(2):249-265. 被引量：3

二级引证文献20

1谢昭,童昊浩,孙永宣,吴克伟.一种仿生物视觉感知的视频轮廓检测方法[J].自动化学报,2015,41(10):1814-1824. 被引量：5
2朱文杰,王广龙,高凤岐,乔中涛,黄瑞.引导滤波与视觉感知结合的自然场景轮廓提取[J].系统工程与电子技术,2017,39(1):206-214. 被引量：1
3权威,黄华.多特征方向偏好轮廓提取算法[J].计算机辅助设计与图形学学报,2018,30(1):100-106. 被引量：5
4袁臣虎,路亮,王岁,李海杰,刘奇.基于概率距离的电脑鼠走迷宫融合算法研究[J].计算机工程,2018,44(9):9-14. 被引量：4
5李莎,孙丽珺.基于TFPCM与随机模型的交通滞留量预测[J].计算机工程,2019,45(1):29-34. 被引量：4
6龚鹏,林京鹏,胡为,王士康,任赵旭.暗夜下基于激光辅助的路面障碍物视觉识别方法[J].科学技术与工程,2019,19(32):225-229. 被引量：1
7高锦雄,杨宏业.基于深度学习的盲人识别研究[J].内蒙古工业大学学报（自然科学版）,2019,38(6):454-459.
8李军华,权小霞,汪宇玲.多特征融合的瓷砖表面缺陷检测算法研究[J].计算机工程与应用,2020,56(15):191-198. 被引量：24
9储开斌,郭俊俊,朱栋.复杂道路环境下车道线快速提取方法[J].实验室研究与探索,2020,39(7):11-15. 被引量：4
10杨建军,常丽萍,李胜,朱霆威,何熊熊.基于新型特征和特征袋模型的内窥镜大肠病变辅助诊断[J].中国生物医学工程学报,2020,39(4):404-412. 被引量：2

1张鸿宾,王佳文.综合结构与纹理特征的图像修复算法[J].北京工业大学学报,2007,33(8):864-869.
2武建章,于春田.决策支持系统体系结构研究[J].河北工业科技,2004,21(5):5-7. 被引量：6
3王沙沙,高飞,温英新,于静.基于FPGA的数字水印提取系统的设计[J].计算机应用,2013,33(3):756-758.
4产品名称：Versa^TM 3D DualBeam[J].中国材料科技与设备,2012,8(2):87-87.
5王司,闫哲,张永才.基于特征方程校正系统的方法[J].信息技术,2001,25(6):37-37.
6王大庆,何晓佳,丁崇生,葛思华,陈花玲.轨迹已知类电液系统控制的工程实现[J].机床与液压,2002,30(1):26-28. 被引量：1
7索春宝,杨东清,刘云鹏.多种角度比较SIFT、SURF、BRISK、ORB、FREAK算法[J].北京测绘,2014,28(4):23-26. 被引量：76
8蔡盈芳.企业数字档案馆结构研究[J].档案学研究,2008(1):26-30. 被引量：6
9赵玉刚.浅谈局域网互联[J].牡丹江师范学院学报（自然科学版）,2002,28(2):11-11.
10王保成.Windows操作系统中的文件系统[J].农业网络信息,2007(7):135-137. 被引量：4

中国科学：信息科学

2012年第6期

浏览历史

内容加载中请稍等...

综合结构和纹理特征的场景识别被引量：4

参考文献18

二级参考文献14

共引文献11

同被引文献47

引证文献4

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

综合结构和纹理特征的场景识别 被引量：4

参考文献18

二级参考文献14

共引文献11

同被引文献47

引证文献4

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

综合结构和纹理特征的场景识别被引量：4