期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Efficient fast mode decision using mode complexity for multi-view video coding 被引量:1
1
作者 王凤随 沈庆宏 都思丹 《Journal of Central South University》 SCIE EI CAS 2014年第11期4244-4253,共10页
The variable block-size motion estimation(ME) and disparity estimation(DE) are adopted in multi-view video coding(MVC) to achieve high coding efficiency. However, much higher computational complexity is also introduce... The variable block-size motion estimation(ME) and disparity estimation(DE) are adopted in multi-view video coding(MVC) to achieve high coding efficiency. However, much higher computational complexity is also introduced in coding system, which hinders practical application of MVC. An efficient fast mode decision method using mode complexity is proposed to reduce the computational complexity. In the proposed method, mode complexity is firstly computed by using the spatial, temporal and inter-view correlation between the current macroblock(MB) and its neighboring MBs. Based on the observation that direct mode is highly possible to be the optimal mode, mode complexity is always checked in advance whether it is below a predefined threshold for providing an efficient early termination opportunity. If this early termination condition is not met, three mode types for the MBs are classified according to the value of mode complexity, i.e., simple mode, medium mode and complex mode, to speed up the encoding process by reducing the number of the variable block modes required to be checked. Furthermore, for simple and medium mode region, the rate distortion(RD) cost of mode 16×16 in the temporal prediction direction is compared with that of the disparity prediction direction, to determine in advance whether the optimal prediction direction is in the temporal prediction direction or not, for skipping unnecessary disparity estimation. Experimental results show that the proposed method is able to significantly reduce the computational load by 78.79% and the total bit rate by 0.07% on average, while only incurring a negligible loss of PSNR(about 0.04 d B on average), compared with the full mode decision(FMD) in the reference software of MVC. 展开更多
关键词 multi-view video coding mode decision mode complexity computational complexity
在线阅读 下载PDF
面向VVC的QP自适应环路滤波器
2
作者 刘鹏宇 金鹏程 《北京工业大学学报》 北大核心 2025年第10期1171-1178,共8页
现有的基于卷积神经网络(convolutional neural network,CNN)的环路滤波器倾向于将多个网络应用于不同的量化参数(quantization parameter,QP),消耗训练模型中的大量资源,并增加内存负担。针对这一问题,提出一种基于CNN的QP自适应环路... 现有的基于卷积神经网络(convolutional neural network,CNN)的环路滤波器倾向于将多个网络应用于不同的量化参数(quantization parameter,QP),消耗训练模型中的大量资源,并增加内存负担。针对这一问题,提出一种基于CNN的QP自适应环路滤波器。首先,设计一个轻量级分类网络,按照滤波难易程度将编码树单元(coding tree unit,CTU)划分为难、中、易3类;然后,构建3个融合了特征信息增强融合模块的基于CNN的滤波网络,以满足不同QP下的3类CTU滤波需求。将所提出的环路滤波器集成到多功能视频编码(versatile video coding,VVC)标准H.266/VVC的测试软件VTM 6.0中,替换原有的去块效应滤波器(deblocking filter,DBF)、样本自适应偏移(sample adaptive offset,SAO)滤波器和自适应环路滤波器。实验结果表明,该方法平均降低了3.14%的比特率差值(Bjøntegaard delta bit rate,BD-BR),与其他基于CNN的环路滤波器相比,显著提高了压缩效率,并减少了压缩伪影。 展开更多
关键词 视频编码 多功能视频编码(versatile video coding VVC)标准 环路滤波 卷积神经网络(convolutional neural network CNN) 深度学习 图像去噪
在线阅读 下载PDF
基于注意力-残差双特征流卷积神经网络的深度图帧内编码单元快速划分算法
3
作者 贾克斌 吴岳珩 《北京工业大学学报》 北大核心 2025年第5期539-551,共13页
针对三维高效视频编码(three-dimensional high efficiency video coding,3D-HEVC)深度图编码单元(coding unit,CU)划分复杂度高的问题,提出一种基于卷积神经网络(convolutional neural networks,CNN)的算法来实现快速深度图帧内编码。... 针对三维高效视频编码(three-dimensional high efficiency video coding,3D-HEVC)深度图编码单元(coding unit,CU)划分复杂度高的问题,提出一种基于卷积神经网络(convolutional neural networks,CNN)的算法来实现快速深度图帧内编码。首先,提出一种具有3个分支的注意力-残差双特征流卷积神经网络(attention-residual bi-feature stream convolutional neural networks,ARBS-CNN)模型,其中基于残差模块(residual module,RM)和特征蒸馏(feature distill,FD)模块的2个分支用于提取全局图像特征,基于动态模块(dynamic module,DM)和卷积-卷积块注意力模块(convolutional-convolutional block attention module,Conv-CBAM)的分支用于提取局部图像特征;然后,将提取到的特征进行整合并输出,得到对深度图CU划分结构的预测;最后,将ARBS-CNN嵌入到3D-HEVC测试平台中,利用预测结果加速深度图帧内编码。与原始算法相比,提出的算法能在维持率失真性能几乎不受影响的条件下,平均减少74.2%的编码时间。实验结果表明,该算法能够在保持率失真性能的条件下,有效降低3D-HEVC的编码复杂度。 展开更多
关键词 三维高效视频编码(three-dimensional high efficiency video coding 3D-HEVC) 深度图 卷积神经网络(convolutional neural networks CNN) 编码单元(coding unit CU)划分 帧内编码 双特征流
在线阅读 下载PDF
基于块编码特点的压缩视频质量增强算法 被引量:1
4
作者 于海 杨磊 +4 位作者 高阳 刘枫琪 刘鹏宇 孙萱 张悦 《北京工业大学学报》 CAS CSCD 北大核心 2024年第9期1069-1076,共8页
针对现有压缩视频质量增强算法未能充分利用压缩视频特点的问题,研究了视频编码与压缩视频质量增强任务之间的本质关系,并针对性地设计了一种基于三维卷积神经网络(3D convolutional neural network, 3D-CNN)的非对齐压缩视频质量增强... 针对现有压缩视频质量增强算法未能充分利用压缩视频特点的问题,研究了视频编码与压缩视频质量增强任务之间的本质关系,并针对性地设计了一种基于三维卷积神经网络(3D convolutional neural network, 3D-CNN)的非对齐压缩视频质量增强算法。实验结果表明:相较于高效视频编码(high efficiency video coding, HEVC)标准H.265,所提算法在低延迟(low delay, LD)配置下且量化参数(quantization parameter, QP)为37时,峰值信噪比(peak signal-to-noise ratio, PSNR)提升了0.465 2 dB;相较于数据压缩会议(data compression conference, DCC)中提出的多帧引导的注意力网络(multi-frame guided attention network, MGANet)方法,该算法PSNR的增长量提升了15.1%。 展开更多
关键词 视频编码 高效视频编码(high efficiency video coding HEVC) 压缩视频质量增强 深度学习 卷积神经网络(convolutional neural network CNN) 三维卷积神经网络(3D convolutional neural network 3D-CNN)
在线阅读 下载PDF
Genetic-optimization framework for SVC transmission based on partial cooperative communication
5
作者 Kai Zhao Yongcheng Sun 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2017年第5期861-870,共10页
A genetic-optimization framework based on the partial cooperation communication protocol is proposed for scalable video coding (SVC) stream transmission under multi-relay amplify and forward cooperative networks. Unli... A genetic-optimization framework based on the partial cooperation communication protocol is proposed for scalable video coding (SVC) stream transmission under multi-relay amplify and forward cooperative networks. Unlike traditional cooperative transmission schemes, the transmission mode for each coded unit in this new protocol can be switched flexibly between direct transmission and cooperative transmission. Obviously, under this protocol, the bandwidth efficiency and transmission robustness can be balanced adaptively according to the priority level of coded units and wireless channel fading characteristics. Based on this, a well-known genetic optimization algorithm-differential evolution is exploited here to find the jointly optimal transmission modes, power allocation and unequal error protection (UEP) channel coding strategies to minimize the end to end reconstructed video distortion. Extensive simulation results show that, compared with classical optimal cooperative UEP transmission schemes, the proposed optimized transmission framework based on the partial cooperative protocol can bring significant peak-signal-to-noise-ratio (PSNR) gains for the reconstructed video in a variety of channel bandwidth, power budget and test sequences. 展开更多
关键词 scalable video coding partial cooperative communication unequal error protection source-relay power allocation differential evolution
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部