Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

Jun 26, 2021

Pulkit Tandon, Shubham Chandak, Pat Pataranutaporn, Yimeng Liu, Anesu M. Mapuranga, Pattie Maes, Tsachy Weissman, Misha Sra

Figure 1 for Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

Figure 2 for Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

Figure 3 for Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

Figure 4 for Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

Share this with someone who'll enjoy it:

Abstract:Video represents the majority of internet traffic today leading to a continuous technological arms race between generating higher quality content, transmitting larger file sizes and supporting network infrastructure. Adding to this is the recent COVID-19 pandemic fueled surge in the use of video conferencing tools. Since videos take up substantial bandwidth (~100 Kbps to few Mbps), improved video compression can have a substantial impact on network performance for live and pre-recorded content, providing broader access to multimedia content worldwide. In this work, we present a novel video compression pipeline, called Txt2Vid, which substantially reduces data transmission rates by compressing webcam videos ("talking-head videos") to a text transcript. The text is transmitted and decoded into a realistic reconstruction of the original video using recent advances in deep learning based voice cloning and lip syncing models. Our generative pipeline achieves two to three orders of magnitude reduction in the bitrate as compared to the standard audio-video codecs (encoders-decoders), while maintaining equivalent Quality-of-Experience based on a subjective evaluation by users (n=242) in an online study. The code for this work is available at https://github.com/tpulkit/txt2vid.git.

* 8 pages, 5 figures, 1 table

View paper on

Share this with someone who'll enjoy it:

Title:Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

Paper and Code