期刊文献+

综合结构和纹理特征的场景识别 被引量:4

Scene recognition combining structural and textural features
原文传递
导出
摘要 当前在计算机视觉领域,场景识别尽管取得了较大进展,但其对于计算机视觉而言,仍然是一个极具挑战的问题.此前的场景识别方法,有些需要预先手动地对训练图像进行语义标注,并且大部分场景识别方法均基于"特征袋"模型,需要对提取的大量特征进行聚类,计算量和内存消耗均很大,且初始聚类中心及聚类数目的选择对识别效果有较大影响.为此本文提出一种不基于"特征袋"模型的无监督场景识别方法.先通过亚采样构建多幅不同分辨率的图像,在多级分辨率图像上,分别提取结构和纹理特征,用本文提出的梯度方向直方图描述方法表示图像的结构特征,用Gabor滤波器组和Schmid滤波集对图像的滤波响应表示图像的纹理特征,并将结构和纹理特征作为相互独立的两个特征通道,最后综合这两个特征通道,通过SVM分类,实现对场景的自动识别.分别在Oliva,Li Fei-Fei和Lazebnik等的8类、13类和15类场景图像库上进行测试实验,实验结果表明,梯度方向直方图描述方法比经典的SIFT描述方法,有着更好的场景识别性能;综合结构和纹理特征的场景识别方法,在通用的三个场景图像库上取得了很好的识别效果. Automatic recognition of the contents of a scene is an important issue in the field of computer vision. Although considerable progress has been made, the complexity of scenes remains an important challenge to computer vision research. Most previous approaches for scene recognition are based on the so-called "bag of visual words" model, which uses clustering methods to quantize numerous local region descriptors into a codebook. The size of the codebook and the selection of initial clustering centers greatly affect the performance. ~rthermore, the large size of the codebook leads to high computational costs and large memory consumption. To overcome these weaknesses, we present an unsupervised natural scene recognition approach that is not based on the "bag of visual words" model. This approach constructs multiple images of different resolutions and extracts structural and textural features from these images. The structural features are represented by weighted histograms of the gradient orientation descriptor, which is presented in this paper, and the textural features are represented by filter responses of Gabor filters and a Schmid set. We regard the structural and textural features as two independent feature channels, and combine them to realize automatic categorization of scenes using a support vector machine. We then evaluated our approach using three commonly used datasets with various scene categories. Our experiments demonstrate that the weighted histograms of the gradient orientation descriptor outperform the classical scale invariant feature transform descriptor in natural-scene recognition, and our approach achieves good performance with respect to current state-of-the-art methods.
出处 《中国科学:信息科学》 CSCD 2012年第6期687-702,共16页 Scientia Sinica(Informationis)
基金 国家自然科学基金(批准号:60835005 60736018) 国家重点基础研究发展计划(批准号:2007CB311001) 湖南省高校科技创新团队资助项目
关键词 场景识别 结构特征 纹理特征 特征融合 梯度方向直方图 scene recognition, structural feature, textural feature, feature combination, weighted histograms of gradient orientation descriptor
作者简介 通信作者.E—mail:li—zhou@yahoo.cn, ZHOU Li was born in 1982. He received the B.S. and M.S. degrees from Dalian Navy Academy in 2004 and 2006, respectively. He is currently working toward the doctoral degree in National University of Defense Technology. His research interests include computer/biological vision, visual navigation, and machine learning. dwhu@nudt.edu.cn,HU DeWen was born in 1963. He received the B.S. and M.S. degrees from Xi'an Jiaotong University in 1983 and 1986, respectively. From 1986, he was with National University of Defense Technology. From October 1995 to October 1996, he was a Visiting Scholar with the University of Sheffield, UK. He got his Ph.D. degree from National University of Defense Technology in 1999.He was promoted Professor in 1996. His research interests include image processing, system identification and control, neural networks, and cognitive science. He is an action editor of neural networks. narcz@163.comZHOU ZongTan was born in 1969. He received the B.S., M.S. and Ph.D. degrees from National University of Defense Technology in 1990, 1994 and 1998, respectively. From February 2010 to February 2011, He was a Visiting Scholar with the Eberhard Karls Uni- versitt Tfibingen. Professor in 2007 He was promoted His research interests include image/signal processing, comouter/biologica.l vision, neural net-works, cognitive neuroscience and brain-computer interface.
  • 相关文献

参考文献18

  • 1CHEN Jing,WANG YongTian,GUO JunWei,LIU Wei,LIN JingDun,XUE Kang,LIU Yue,DING GangYi.Augmented reality registration algorithm based on nature feature recognition[J].Science China(Information Sciences),2010,53(8):1555-1565. 被引量:12
  • 2J. Zhang,M. Marsza?ek,S. Lazebnik,C. Schmid.Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study[J].International Journal of Computer Vision.2007(2)
  • 3Julia Vogel,Bernt Schiele.Semantic Modeling of Natural Scenes for Content-Based Image Retrieval[J].International Journal of Computer Vision.2007(2)
  • 4David G. Lowe.Distinctive Image Features from Scale-Invariant Keypoints[J].International Journal of Computer Vision.2004(2)
  • 5Krystian Mikolajczyk,Cordelia Schmid.Scale & Affine Invariant Interest Point Detectors[J].International Journal of Computer Vision.2004(1)
  • 6Pedro F. Felzenszwalb,Daniel P. Huttenlocher.Efficient Graph-Based Image Segmentation[J].International Journal of Computer Vision.2004(2)
  • 7Antonio Torralba.Contextual Priming for Object Detection[J].International Journal of Computer Vision.2003(2)
  • 8Aude Oliva,Antonio Torralba.Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope[J].International Journal of Computer Vision.2001(3)
  • 9Bosch A,Zisserman A,Muoz X.Image classification using random forests and ferns[].Proceedings of the th International Conference on Computer Vision.2007
  • 10Thorpe S,Fize D,Marlot C.Speed of processing in the human visual system[].Nature.1996

二级参考文献14

  • 1Azuma R, Baillot Y, Behringer R, et al. Recent advances in augmented reality. Comput Graph Appl, 2001, 21:34-47.
  • 2State A, Chen D T, Chris T, et al. Case study: observing a volume-rendered fetus within a pregnant patient. In: Proceeding of IEEE Visualization. Los Alamitos: IEEE Computer Society Press, 1994. 364-368.
  • 3Zaeh M, Vogl W. Interactive laser-projection for programming industrial robots. Manufact Tech, 2008, 57:37-40.
  • 4Stricker D, Daehne P, Seibert F, et al. Design and development issues for an archeoguide: an augmented reality based cultural heritage on-site guide. In: International Conference on Augmented, Virtual Environments and Three- Dimensional Imaging, Mykonos, Greece, 2001.
  • 5Papagiannakis G, Schertenleib S. Mixing virtual and real scenes in the site of ancient Pompeii. J Comput Animat Virtual Worlds, 2005, 16:11-24.
  • 6Julier S, Baillot Y, Lanzagorta M. BARS: batterfield augmented reality system. In: NATO Symposium on Information Processing Techniques for Military Systems. Istanbul: IEEE Computer Society Press, 2000. 9-11.
  • 7Gerhard R, Drummond T. Going out: robust model-based tracking for outdoor augmented reality. In: The Proceeding of Symposium on Augmented Reality. Santa Barbara: IEEE Computer Society Press, 2006. 109-118.
  • 8Simon G, Fitzgibbon A, Zisserman A. Markerless tracking using planar structures in the scene. In: Proc International Symposium on Augmented Reality, Munich, 2000.
  • 9Lowe D G. Distinctive image features from scale-invariant keypoints. Int J Comput Vision, 2004, 60:91-110.
  • 10Lepetit V, Pilet J, Fua P. Point matching as a classification problem for fast and robust object pose estimation. In: Conference on Computer Vision and Pattern Recognition. Washington: IEEE Computer Society Press, 2004. 244-250.

共引文献11

同被引文献47

  • 1曹琼,郑红,李行善.一种基于纹理特征的卫星遥感图像云探测方法[J].航空学报,2007,28(3):661-666. 被引量:32
  • 2Baddeley R J, Tatler B W. High frequency edges (but not contrast) predict where we fixate: a Bayesian system identification analysis. Vision Res, 2006, 46:2824-2833.
  • 3Bartolucci M, Smith A T. Attentional modulation in visual cortex is modified during perceptual learning. Neuropsy- chologia, 2011, 49:3898-3907.
  • 4Braddick O, Atkinson J. Development of human visual function. Vision Res, 2011, 51: 1588-1609.
  • 5Herzog M H, Fahle M. Effects of grouping in contextual modulation. Nature, 2002, 415:433-436.
  • 6Willmore B D B, Bulstrode H, Tolhurst D J. Contrast normalization contributes to a biologically-plausible model of receptive-field development in primary visual cortex (V1). Vision Res, 2012, 54:49-60.
  • 7Grossberg S, Mingolla E, Ross W. Visual brain and visual perception: how does the cortex do perceptual grouping? Trends Neurosci, 1997, 20:106-111.
  • 8Li Z. A neural model of contour integration in the primary visual cortex. Neural Comput, 1998, 10:903-940.
  • 9Grigorescu C, Petkov N, Westenberg M A. Contour detection based on nonclassical receptive field inhibition. IEEE Trans Image Process, 2003, 12:729-739.
  • 10Li C. Integration fields beyond the classical receptive field: organization and functional properties. News Physiol Sci, 1996, 11:181-186.

引证文献4

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部