行为识别在语义分析领域具有很高的学术研究价值和广泛的市场应用前景.为了实现对视频行为的准确描述,提出了2类构建稠密轨迹运动描述子的方法.1)通过光流约束和聚类,实现对运动区域的稠密采样,以获取行为的局部位置信息;2)选取目标运...行为识别在语义分析领域具有很高的学术研究价值和广泛的市场应用前景.为了实现对视频行为的准确描述,提出了2类构建稠密轨迹运动描述子的方法.1)通过光流约束和聚类,实现对运动区域的稠密采样,以获取行为的局部位置信息;2)选取目标运动角点为特征点,通过对特征点的跟踪获取运动轨迹;3)在以轨迹为中心的视频立方体内,分别构建三维梯度方向直方图(3Dhistograms of oriented gradients in trajectory centered cube,3DHOGTCC)描述子和三维光流梯度方向直方图(3Dhistograms of oriented optical flow gradients,3DHOOFG)描述子,用以对运动的局部信息进行准确描述.为了充分利用行为发生的场景信息,提出了一种融合动态描述子和静态描述子的行为识别新框架,使得动态特征与静态特征相互融合支撑,即使在摄像头运动等复杂场景下,亦能取得较好的识别效果.在Weizmann和UCF-Sports数据库采用留一交叉验证,在KTH和Youtube数据库采用4折交叉验证.实验证明了提出新框架的有效性.展开更多
Most research on anomaly detection has focused on event that is different from its spatial-temporal neighboring events.It is still a significant challenge to detect anomalies that involve multiple normal events intera...Most research on anomaly detection has focused on event that is different from its spatial-temporal neighboring events.It is still a significant challenge to detect anomalies that involve multiple normal events interacting in an unusual pattern.In this work,a novel unsupervised method based on sparse topic model was proposed to capture motion patterns and detect anomalies in traffic surveillance.scale-invariant feature transform(SIFT)flow was used to improve the dense trajectory in order to extract interest points and the corresponding descriptors with less interference.For the purpose of strengthening the relationship of interest points on the same trajectory,the fisher kernel method was applied to obtain the representation of trajectory which was quantized into visual word.Then the sparse topic model was proposed to explore the latent motion patterns and achieve a sparse representation for the video scene.Finally,two anomaly detection algorithms were compared based on video clip detection and visual word analysis respectively.Experiments were conducted on QMUL Junction dataset and AVSS dataset.The results demonstrated the superior efficiency of the proposed method.展开更多
A new method for interaction recognition based on sparse representation of feature covariance matrices was presented.Firstly,the dense trajectories(DT)extracted from the video were clustered into different groups to e...A new method for interaction recognition based on sparse representation of feature covariance matrices was presented.Firstly,the dense trajectories(DT)extracted from the video were clustered into different groups to eliminate the irrelevant trajectories,which could greatly reduce the noise influence on feature extraction.Then,the trajectory tunnels were characterized by means of feature covariance matrices.In this way,the discriminative descriptors could be extracted,which was also an effective solution to the problem that the description of the feature second-order statistics is insufficient.After that,an over-complete dictionary was learned with the descriptors and all the descriptors were encoded using sparse coding(SC).Classification was achieved using multiple instance learning(MIL),which was more suitable for complex environments.The proposed method was tested and evaluated on the WEB Interaction dataset and the UT interaction dataset.The experimental results demonstrated the superior efficiency.展开更多
文摘行为识别在语义分析领域具有很高的学术研究价值和广泛的市场应用前景.为了实现对视频行为的准确描述,提出了2类构建稠密轨迹运动描述子的方法.1)通过光流约束和聚类,实现对运动区域的稠密采样,以获取行为的局部位置信息;2)选取目标运动角点为特征点,通过对特征点的跟踪获取运动轨迹;3)在以轨迹为中心的视频立方体内,分别构建三维梯度方向直方图(3Dhistograms of oriented gradients in trajectory centered cube,3DHOGTCC)描述子和三维光流梯度方向直方图(3Dhistograms of oriented optical flow gradients,3DHOOFG)描述子,用以对运动的局部信息进行准确描述.为了充分利用行为发生的场景信息,提出了一种融合动态描述子和静态描述子的行为识别新框架,使得动态特征与静态特征相互融合支撑,即使在摄像头运动等复杂场景下,亦能取得较好的识别效果.在Weizmann和UCF-Sports数据库采用留一交叉验证,在KTH和Youtube数据库采用4折交叉验证.实验证明了提出新框架的有效性.
基金国家自然科学基金(6117212761401001)+4 种基金高等学校博士学科点专项科研基金(20113401110006)安徽省自然科学基金(1508085MF120)资助Supported by National Natural Science Foundation of China(6117212761401001)Specialized Research Fund for the Doctoral Program of Higher Education of China(20113401110006)and Anhui Provincial Natural Science Foundation(1508085MF120)
基金Project(50808025)supported by the National Natural Science Foundation of ChinaProject(20090162110057)supported by the Doctoral Fund of Ministry of Education,China
文摘Most research on anomaly detection has focused on event that is different from its spatial-temporal neighboring events.It is still a significant challenge to detect anomalies that involve multiple normal events interacting in an unusual pattern.In this work,a novel unsupervised method based on sparse topic model was proposed to capture motion patterns and detect anomalies in traffic surveillance.scale-invariant feature transform(SIFT)flow was used to improve the dense trajectory in order to extract interest points and the corresponding descriptors with less interference.For the purpose of strengthening the relationship of interest points on the same trajectory,the fisher kernel method was applied to obtain the representation of trajectory which was quantized into visual word.Then the sparse topic model was proposed to explore the latent motion patterns and achieve a sparse representation for the video scene.Finally,two anomaly detection algorithms were compared based on video clip detection and visual word analysis respectively.Experiments were conducted on QMUL Junction dataset and AVSS dataset.The results demonstrated the superior efficiency of the proposed method.
基金Project(51678075) supported by the National Natural Science Foundation of ChinaProject(2017GK2271) supported by the Science and Technology Project of Hunan Province,China
文摘A new method for interaction recognition based on sparse representation of feature covariance matrices was presented.Firstly,the dense trajectories(DT)extracted from the video were clustered into different groups to eliminate the irrelevant trajectories,which could greatly reduce the noise influence on feature extraction.Then,the trajectory tunnels were characterized by means of feature covariance matrices.In this way,the discriminative descriptors could be extracted,which was also an effective solution to the problem that the description of the feature second-order statistics is insufficient.After that,an over-complete dictionary was learned with the descriptors and all the descriptors were encoded using sparse coding(SC).Classification was achieved using multiple instance learning(MIL),which was more suitable for complex environments.The proposed method was tested and evaluated on the WEB Interaction dataset and the UT interaction dataset.The experimental results demonstrated the superior efficiency.