Video Concept Detection Based on Multiple Features and Classifiers Fusion 被引量：1

Video Concept Detection Based on Multiple Features and Classifiers Fusion

在线阅读下载PDF

导出

摘要 The rapid growth of multimedia content necessitates powerful technologies to filter, classify, index and retrieve video documents more efficiently. However, the essential bottleneck of image and video analysis is the problem of semantic gap that low level features extracted by computers always fail to coincide with high-level concepts interpreted by humans. In this paper, we present a generic scheme for the detection video semantic concepts based on multiple visual features machine learning. Various global and local low-level visual features are systelrtically investigated, and kernelbased learning method equips the concept detection system to explore the potential of these features. Then we combine the different features and sub-systen on both classifier-level and kernel-level fusion that contribute to a more robust system Our proposed system is tested on the TRECVID dataset. The resulted Mean Average Precision （MAP） score is rmch better than the benchmark perforrmnce, which proves that our concepts detection engine develops a generic model and perforrrs well on both object and scene type concepts. The rapid growth of multimedia content necessitates powerful technologies to filter, classify, index and retrieve video documents more efficiently. However, the essential bottleneck of image and video analysis is the problem of semantic gap that low level features extracted by computers always fail to coincide with high-level concepts interpreted by humans. In this paper, we present a generic scheme for the detection video semantic concepts based on multiple visual features machine learning. Various global and local low-level visual features are systematically investigated, and kernel-based learning method equips the concept detection system to explore the potential of these features. Then we combine the different features and sub-systems on both classifier-level and kernel-level fusion that contribute to a more robust system. Our proposed system is tested on the TRECVID dataset. The resulted Mean Average Precision (MAP) score is much better than the benchmark performance, which proves that our concepts detection engine develops a generic model and performs well on both object and scene type concepts.

作者 Dong Yuan Zhang Jiwei Zhao Nan Chang Xiaofu Liu Wei

机构地区 Beijing University of Posts and Telecommunications France Telecom Research & Development Center

出处《China Communications》 SCIE CSCD 2012年第8期105-121,共17页 中国通信（英文版）

基金 Acknowledgements This paper was supported by the coUabomtive Research Project SEV under Cant No. 01100474 between Beijing University of Posts and Telecorrrcnications and France Telecom R＆D Beijing the National Natural Science Foundation of China under Cant No. 90920001 the Caduate Innovation Fund of SICE, BUPT, 2011.

关键词 concept detection visual feature extraction kemel-based learning classifier fusion 分类器融合检测系统视频文件特征和多媒体内容语义鸿沟视频分析语义概念

分类号 TP391.4 [自动化与计算机技术—计算机应用技术] TP274 [自动化与计算机技术—检测技术与自动化装置]

作者简介 Dong Yuan, associate professor at Beijing University of Posts and Telecommunications, China, also a senior research consultant at the France Telecorn Research ＆ Development Beijing in multimedia indexing research. He received his Ph.D. degree in Shanghai Jiao Tong University and worked as a postdoctoral research staff at the Engineering Department, Cambridge University, UK. He is now working on European speech recognition project-CORtEX. His cunent research interests include senntic video indexing, video copy detection, and multimedia content search. Email： yuandong@bupt.edu.cnZhang Jiwei postgraduate student from Pattern Recognition Lab, Beijng University of Posts and Telecornunications, China. His current research interests include visual concept detection and sports categorization. Email： buptjiwei@gmail.comZhao Nan, postgraduate student from Pattern Recognition Lab, Beijing University of Posts and Telecorrnunications, China. Her current research inter- ests include visual concept detection and sports categorization. Ernail： zhao.nan07@gmail.comChang Xiaofu, researcher of Multimedia Analysis and Retrieval, France Telecom Research ＆ Development-Beijing, China. H/s research interests include image/video search, object recognition, and data rrming. Email： xiaofu.chang@orange.comLiu Wei, researcher of Multimedia Analysis andRetrieval, France Telecom Research ＆ Development-Beijing, China. His current research interests include video and image copy detection, and face detection. Email： wei.liu@omnge.com

引文网络
相关文献

参考文献49

1HOU Xiaodi, ZHANG Liqing. Saliency Detection: A Spectral Residual Approach[C]// Proceedings of IEEE Conference on Computer \%ion and Pattern Recognition: June 17-22,2007, Minneapolis, MN. IEEE Press, 2007: 1-8.
2SMEULDERS A, WORRING M, SANTINI S, et al. Content-Based Image Retrieval at the End of the Early Years [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(12): 1349-1380.
3UENHART R, KUHMUNCH C, EFFELSBERG W. On the Detection and Recognition of Television Commercials [C]// Proceedings of 1KHH Conference on Multimedia Conputing and Systems: June 3-6,1997, Ottawa, Ont.. IEEE Press, 1997: 509-516.
4ZHANG Hongjiang, TAN S, SMOIiAR S, et al Automatic Parsing and Indexing of News Video [J], Multimedia Systems, 1995, 2(6): 256-266.
5RUI Yong, GUPTA A, ACERO A. Automatically Extracting Highlights for TV Baseball Programs [C]// Proceedings of the 8th ACM International Conference on Multimedia: October 30-Noven4)er 03,2000, Los Angeles, CA, USA. ACM Press, 2000: 105-115.
6VIOLA P, JONES M. Rapid Object Detection Using a Bossed Cascade of Simple Featuress fC]// Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2011, 1:1-511-I-518.
7ITTI L, KOCH C,NIEBUR E A Model of Saliency-Based Visual Attention for Rapid Scene Analysis [J]. I 匕fcife Transactions on Pattern Analysis and Machine Intelligence, 1998, 20(11): 1254-1259.
8HANJAIiC A, XU Liqun. Affective Video Content Representation and Modelling[J]. IEEE Transactions on Multimedia, 2005, 7(1): 143-154.
9NAPHIDE H,HUANG T. A Probabilistic Framework for Semantic Video Indexing, filtering, and Retrieval [J]. IEEE Transactions on Multimedia, 2001,3(1): 141-151.
10SNOEK G, WORRING M, GEUSEBROEK J, et al. The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006,28(10): 1678-1689.

同被引文献5

1闫乐林,冯希叶.一种基于内容的视频情感类型识别算法[J].计算机系统应用,2011,20(3):102-105. 被引量：2
2芦效峰,HUI Pan,Pietro Lio.Offloading Mobile Data from Cellular Networks Through Peer-to-Peer WiFi Communication:A Subscribe-and-Send Architecture[J].China Communications,2013,10(6):35-46. 被引量：1
3林新棋.基于改进模糊综合评价的电影情感分类[J].计算机科学,2014,41(2):161-165. 被引量：2
4丁昕苗,李兵,胡卫明,郭文,王振翀.基于多视角融合稀疏表示的恐怖视频识别[J].电子学报,2014,42(2):301-305. 被引量：7
5陈昌红,干宗良.Action Recognition from a Different View[J].China Communications,2013,10(12):139-148. 被引量：1

引证文献1

1SONG Wei,YANG Pei,YANG Guosheng,MA ChuanLian,YU Jing,LIMing.Horror Video Recognition Based on Fuzzy Comprehensive Evolution[J].China Communications,2014,11(A02):86-94. 被引量：2

二级引证文献2

1叶天宇.基于模糊综合评价法的天然气管道风险评价研究[J].数码世界,2017,0(7):18-19.
2王思华,吴健.高铁牵引变电所风险模糊综合评估方法研究[J].铁道科学与工程学报,2017,14(8):1589-1596. 被引量：6

1Ya-hong HAN Jian SHAO Fei WU Bao-gang WEI.Multiple hypergraph ranking for video concept detection[J].Journal of Zhejiang University-Science C(Computers and Electronics),2010,11(7):525-537. 被引量：1
2陈方琼,余正涛,毛存礼,吴则键,张优敏.Expert ranking method based on ListNet with multiple features[J].Journal of Beijing Institute of Technology,2014,23(2):240-247. 被引量：1
3张军,浦元元.文件内容动态识别算法[J].装备指挥技术学院学报,2010,21(6):102-105.
4俞璐,谢钧,张艳艳.Using Neural Networks to Combine Multiple Features in Remote Sensing Image Classification[J].Journal of Donghua University(English Edition),2015,32(2):225-228.
5李新德,潘锦东,DEZERT Jean.一种基于DSmT和HMM的序列飞机目标识别算法[J].自动化学报,2014,40(12):2862-2876. 被引量：17
6史颖,王文剑,白雪飞.多特征三维稠密重建方法[J].计算机科学与探索,2015,9(5):594-603. 被引量：7
7樊昀,王润生.An Image Retrieval Method Using DCT Features[J].Journal of Computer Science & Technology,2002,17(6):865-873.
8顾弘,赵光宙,裘君.Online Metric Learning for Relevance Feedback in E-Commerce Image Retrieval Applications[J].Tsinghua Science and Technology,2011,16(4):377-385. 被引量：1
9XU Mei,SONG Yong-duan,LV Shao-dong,LIU Zhi-long,HUANG Cong-ying.Design of Objects Tracking System Based on ARM Embedded Platform[J].Computer Aided Drafting,Design and Manufacturing,2014,24(3):65-69. 被引量：1

China Communications

2012年第8期

浏览历史

内容加载中请稍等...

Video Concept Detection Based on Multiple Features and Classifiers Fusion 被引量：1

参考文献49

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史