Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhao Zan

Fast QTMT Partition for VVC Intra Coding Using U-Net Framework

Apr 06, 2023

Zhao Zan, Leilei Huang, ShuShi Chen, Xiantao Zhang, Zhenghui Zhao, Haibing Yin, Yibo Fan

Abstract:Versatile Video Coding (VVC) has significantly increased encoding efficiency at the expense of numerous complex coding tools, particularly the flexible Quad-Tree plus Multi-type Tree (QTMT) block partition. This paper proposes a deep learning-based algorithm applied in fast QTMT partition for VVC intra coding. Our solution greatly reduces encoding time by early termination of less-likely intra prediction and partitions with negligible BD-BR increase. Firstly, a redesigned U-Net is recommended as the network's fundamental framework. Next, we design a Quality Parameter (QP) fusion network to regulate the effect of QPs on the partition results. Finally, we adopt a refined post-processing strategy to better balance encoding performance and complexity. Experimental results demonstrate that our solution outperforms the state-of-the-art works with a complexity reduction of 44.74% to 68.76% and a BD-BR increase of 0.60% to 2.33%.

Via

Access Paper or Ask Questions

Learned Image Compression with Separate Hyperprior Decoders

Oct 31, 2021

Zhao Zan, Chao Liu, Heming Sun, Xiaoyang Zeng, Yibo Fan

Figure 1 for Learned Image Compression with Separate Hyperprior Decoders

Figure 2 for Learned Image Compression with Separate Hyperprior Decoders

Figure 3 for Learned Image Compression with Separate Hyperprior Decoders

Figure 4 for Learned Image Compression with Separate Hyperprior Decoders

Abstract:Learned image compression techniques have achieved considerable development in recent years. In this paper, we find that the performance bottleneck lies in the use of a single hyperprior decoder, in which case the ternary Gaussian model collapses to a binary one. To solve this, we propose to use three hyperprior decoders to separate the decoding process of the mixed parameters in discrete Gaussian mixture likelihoods, achieving more accurate parameters estimation. Experimental results demonstrate the proposed method optimized by MS-SSIM achieves on average 3.36% BD-rate reduction compared with state-of-the-art approach. The contribution of the proposed method to the coding time and FLOPs is negligible.

* This paper has been accepted by IEEE Open Journal of Circuits and Systems

Via

Access Paper or Ask Questions