Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Aug 02, 2020

Yanhui Guo, Xi Zhang, Xiaolin Wu

Figure 1 for Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Figure 2 for Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Figure 3 for Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Figure 4 for Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Share this with someone who'll enjoy it:

Abstract:We propose a novel deep multi-modality neural network for restoring very low bit rate videos of talking heads. Such video contents are very common in social media, teleconferencing, distance education, tele-medicine, etc., and often need to be transmitted with limited bandwidth. The proposed CNN method exploits the correlations among three modalities, video, audio and emotion state of the speaker, to remove the video compression artifacts caused by spatial down sampling and quantization. The deep learning approach turns out to be ideally suited for the video restoration task, as the complex non-linear cross-modality correlations are very difficult to model analytically and explicitly. The new method is a video post processor that can significantly boost the perceptual quality of aggressively compressed talking head videos, while being fully compatible with all existing video compression standards.

* Proceedings of the 28th ACM International Conference on Multimedia,2020 * Accepted by Proceedings of the 28th ACM International Conference on Multimedia(ACM MM),2020

View paper on

Share this with someone who'll enjoy it:

Title:Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Paper and Code