Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thomas Schierl

Compression of 3D Gaussian Splatting with Optimized Feature Planes and Standard Video Codecs

Jan 06, 2025

Soonbin Lee, Fangwen Shu, Yago Sanchez, Thomas Schierl, Cornelius Hellge

Abstract:3D Gaussian Splatting is a recognized method for 3D scene representation, known for its high rendering quality and speed. However, its substantial data requirements present challenges for practical applications. In this paper, we introduce an efficient compression technique that significantly reduces storage overhead by using compact representation. We propose a unified architecture that combines point cloud data and feature planes through a progressive tri-plane structure. Our method utilizes 2D feature planes, enabling continuous spatial representation. To further optimize these representations, we incorporate entropy modeling in the frequency domain, specifically designed for standard video codecs. We also propose channel-wise bit allocation to achieve a better trade-off between bitrate consumption and feature plane representation. Consequently, our model effectively leverages spatial correlations within the feature planes to enhance rate-distortion performance using standard, non-differentiable video codecs. Experimental results demonstrate that our method outperforms existing methods in data compactness while maintaining high rendering quality. Our project page is available at https://fraunhoferhhi.github.io/CodecGS

Via

Access Paper or Ask Questions

ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain Optimization

Nov 23, 2023

Soonbin Lee, Fangwen Shu, Yago Sanchez, Thomas Schierl, Cornelius Hellge

Abstract:Explicit feature-grid based NeRF models have shown promising results in terms of rendering quality and significant speed-up in training. However, these methods often require a significant amount of data to represent a single scene or object. In this work, we present a compression model that aims to minimize the entropy in the frequency domain in order to effectively reduce the data size. First, we propose using the discrete cosine transform (DCT) on the tensorial radiance fields to compress the feature-grid. This feature-grid is transformed into coefficients, which are then quantized and entropy encoded, following a similar approach to the traditional video coding pipeline. Furthermore, to achieve a higher level of sparsity, we propose using an entropy parameterization technique for the frequency domain, specifically for DCT coefficients of the feature-grid. Since the transformed coefficients are optimized during the training phase, the proposed model does not require any fine-tuning or additional information. Our model only requires a lightweight compression pipeline for encoding and decoding, making it easier to apply volumetric radiance field methods for real-world applications. Experimental results demonstrate that our proposed frequency domain entropy model can achieve superior compression performance across various datasets. The source code will be made publicly available.

* 10 pages, 6 figures, 4 tables

Via

Access Paper or Ask Questions

Enabling sub-THz Cloud RANs: Distributed Machine-Learning for Early HARQ Feedback Prediction

Feb 18, 2022

Barış Göktepe, Cornelius Hellge, Thomas Schierl, Slawomir Stanczak

Figure 1 for Enabling sub-THz Cloud RANs: Distributed Machine-Learning for Early HARQ Feedback Prediction

Figure 2 for Enabling sub-THz Cloud RANs: Distributed Machine-Learning for Early HARQ Feedback Prediction

Figure 3 for Enabling sub-THz Cloud RANs: Distributed Machine-Learning for Early HARQ Feedback Prediction

Figure 4 for Enabling sub-THz Cloud RANs: Distributed Machine-Learning for Early HARQ Feedback Prediction

Abstract:We propose novel HARQ prediction schemes for Cloud RANs (C-RANs) that use feedback over a rate-limited feedback channel (4 and 8 bits) from the Remote Radio Heads (RRHs) to predict at the User Equipment (UE) the decoding outcome at the BaseBand Unit (BBU) ahead of actual decoding. In particular, we propose a novel dual-input denoising autoencoder that is trained in a joint end-to-end fashion over the whole C-RAN setup. In realistic link-level simulations at 100 GHz in the sub-THz band, we show that a combination of the novel dual-input denoising autoencoder and state-of-the-art SNR-based HARQ feedback prediction achieves the overall best performance in all scenarios compared to other proposed and state-of-the-art single prediction schemes. At very low target error rates down to $1.6 \cdot 10^{-5}$, this combined approach reduces the number of required transmission rounds by up to 50\% compared to always transmitting all redundancy.

Via

Access Paper or Ask Questions

Open GOP Resolution Switching in HTTP Adaptive Streaming with VVC

Mar 11, 2021

Robert Skupin, Christian Bartnik, Adam Wieckowski, Yago Sanchez, Benjamin Bross, Cornelius Hellge, Thomas Schierl

Figure 1 for Open GOP Resolution Switching in HTTP Adaptive Streaming with VVC

Figure 2 for Open GOP Resolution Switching in HTTP Adaptive Streaming with VVC

Figure 3 for Open GOP Resolution Switching in HTTP Adaptive Streaming with VVC

Figure 4 for Open GOP Resolution Switching in HTTP Adaptive Streaming with VVC

Abstract:The user experience in adaptive HTTP streaming relies on offering bitrate ladders with suitable operation points for all users and typically involves multiple resolutions. While open GOP coding structures are generally known to provide substantial coding efficiency benefit, their use in HTTP streaming has been precluded through lacking support of reference picture resampling (RPR) in AVC and HEVC. The newly emerging Versatile Video Coding (VVC) standard supports RPR, but only conversational scenarios were primarily investigated during the design of VVC. This paper aims at enabling usage of RPR in HTTP streaming scenarios through analysing the drift potential of VVC coding tools and presenting a constrained encoding method that avoids severe drift artefacts in resolution switching with open GOP coding in VVC. In typical live streaming configurations, the presented method achieves up to -8.7% BD-rate reduction compared to closed GOP coding while in a typical Video on Demand configuration, up to -2.4% BD-rate reduction is reported. The constraints penalty compared to regular open GOP coding is 0.53% BD-rate in the worst case. The presented method will be integrated into the publicly available open source VVC encoder VVenC v0.3.

* Submitted to IEEE PCS 2021, 5 pages, 3 figures

Via

Access Paper or Ask Questions

Enhanced Machine Learning Techniques for Early HARQ Feedback Prediction in 5G

Jul 27, 2018

Nils Strodthoff, Barış Göktepe, Thomas Schierl, Cornelius Hellge, Wojciech Samek

Figure 1 for Enhanced Machine Learning Techniques for Early HARQ Feedback Prediction in 5G

Figure 2 for Enhanced Machine Learning Techniques for Early HARQ Feedback Prediction in 5G

Figure 3 for Enhanced Machine Learning Techniques for Early HARQ Feedback Prediction in 5G

Figure 4 for Enhanced Machine Learning Techniques for Early HARQ Feedback Prediction in 5G

Abstract:We investigate Early Hybrid Automatic Repeat reQuest (E-HARQ) feedback schemes enhanced by Machine Learning techniques as possible path towards ultra-reliable and low-latency communication (URLLC). To this end we propose Machine Learning methods to predict the outcome of the decoding process ahead of the end of the transmission. We discuss different input features and classification algorithms ranging from traditional methods to newly developed supervised autoencoders and their prospects of reaching effective block error rates of $10^{-5}$ that are required for URLLC with only small latency overhead. We provide realistic performance estimates in a system model incorporating scheduling effects to demonstrate the feasibility of E-HARQ across different signal-to-noise ratios, subcode lengths, channel conditions and system loads.

* 14 page, 8 figures

Via

Access Paper or Ask Questions