Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rick Fritschek

MinGRU-Based Encoder for Turbo Autoencoder Frameworks

Mar 11, 2025

Rick Fritschek, Rafael F. Schaefer

Abstract:Early neural channel coding approaches leveraged dense neural networks with one-hot encodings to design adaptive encoder-decoder pairs, improving block error rate (BLER) and automating the design process. However, these methods struggled with scalability as the size of message sets and block lengths increased. TurboAE addressed this challenge by focusing on bit-sequence inputs rather than symbol-level representations, transforming the scalability issue associated with large message sets into a sequence modeling problem. While recurrent neural networks (RNNs) were a natural fit for sequence processing, their reliance on sequential computations made them computationally expensive and inefficient for long sequences. As a result, TurboAE adopted convolutional network blocks, which were faster to train and more scalable, but lacked the sequential modeling advantages of RNNs. Recent advances in efficient RNN architectures, such as minGRU and minLSTM, and structured state space models (SSMs) like S4 and S6, overcome these limitations by significantly reducing memory and computational overhead. These models enable scalable sequence processing, making RNNs competitive for long-sequence tasks. In this work, we revisit RNNs for Turbo autoencoders by integrating the lightweight minGRU model with a Mamba block from SSMs into a parallel Turbo autoencoder framework. Our results demonstrate that this hybrid design matches the performance of convolutional network-based Turbo autoencoder approaches for short sequences while significantly improving scalability and training efficiency for long block lengths. This highlights the potential of efficient RNNs in advancing neural channel coding for long-sequence scenarios.

* 6 pages, accepted at ICMLCN25

Via

Access Paper or Ask Questions

Learning End-to-End Channel Coding with Diffusion Models

Sep 21, 2023

Muah Kim, Rick Fritschek, Rafael F. Schaefer

Abstract:The training of neural encoders via deep learning necessitates a differentiable channel model due to the backpropagation algorithm. This requirement can be sidestepped by approximating either the channel distribution or its gradient through pilot signals in real-world scenarios. The initial approach draws upon the latest advancements in image generation, utilizing generative adversarial networks (GANs) or their enhanced variants to generate channel distributions. In this paper, we address this channel approximation challenge with diffusion models, which have demonstrated high sample quality in image generation. We offer an end-to-end channel coding framework underpinned by diffusion models and propose an efficient training algorithm. Our simulations with various channel models establish that our diffusion models learn the channel distribution accurately, thereby achieving near-optimal end-to-end symbol error rates (SERs). We also note a significant advantage of diffusion models: A robust generalization capability in high signal-to-noise ratio regions, in contrast to GAN variants that suffer from error floor. Furthermore, we examine the trade-off between sample quality and sampling speed, when an accelerated sampling algorithm is deployed, and investigate the effect of the noise scheduling on this trade-off. With an apt choice of noise scheduling, sampling time can be significantly reduced with a minor increase in SER.

Via

Access Paper or Ask Questions

Concatenated Classic and Neural (CCN) Codes: ConcatenatedAE

Sep 04, 2022

Onur Günlü, Rick Fritschek, Rafael F. Schaefer

Figure 1 for Concatenated Classic and Neural (CCN) Codes: ConcatenatedAE

Figure 2 for Concatenated Classic and Neural (CCN) Codes: ConcatenatedAE

Figure 3 for Concatenated Classic and Neural (CCN) Codes: ConcatenatedAE

Figure 4 for Concatenated Classic and Neural (CCN) Codes: ConcatenatedAE

Abstract:Small neural networks (NNs) used for error correction were shown to improve on classic channel codes and to address channel model changes. We extend the code dimension of any such structure by using the same NN under one-hot encoding multiple times, which are serially-concatenated with an outer classic code. We design NNs with the same network parameters, where each Reed-Solomon codeword symbol is an input to a different NN. Significant improvements in block error probabilities for an additive Gaussian noise channel as compared to the small neural code are illustrated, as well as robustness to channel model changes.

* 6 pages

Via

Access Paper or Ask Questions

A Reverse Jensen Inequality Result with Application to Mutual Information Estimation

Nov 12, 2021

Gerhard Wunder, Benedikt Groß, Rick Fritschek, Rafael F. Schaefer

Figure 1 for A Reverse Jensen Inequality Result with Application to Mutual Information Estimation

Figure 2 for A Reverse Jensen Inequality Result with Application to Mutual Information Estimation

Abstract:The Jensen inequality is a widely used tool in a multitude of fields, such as for example information theory and machine learning. It can be also used to derive other standard inequalities such as the inequality of arithmetic and geometric means or the H\"older inequality. In a probabilistic setting, the Jensen inequality describes the relationship between a convex function and the expected value. In this work, we want to look at the probabilistic setting from the reverse direction of the inequality. We show that under minimal constraints and with a proper scaling, the Jensen inequality can be reversed. We believe that the resulting tool can be helpful for many applications and provide a variational estimation of mutual information, where the reverse inequality leads to a new estimator with superior training behavior compared to current estimators.

* 6 pages, ITW 2021

Via

Access Paper or Ask Questions

Reinforce Security: A Model-Free Approach Towards Secure Wiretap Coding

Jun 01, 2021

Rick Fritschek, Rafael F. Schaefer, Gerhard Wunder

Figure 1 for Reinforce Security: A Model-Free Approach Towards Secure Wiretap Coding

Figure 2 for Reinforce Security: A Model-Free Approach Towards Secure Wiretap Coding

Figure 3 for Reinforce Security: A Model-Free Approach Towards Secure Wiretap Coding

Abstract:The use of deep learning-based techniques for approximating secure encoding functions has attracted considerable interest in wireless communications due to impressive results obtained for general coding and decoding tasks for wireless communication systems. Of particular importance is the development of model-free techniques that work without knowledge about the underlying channel. Such techniques utilize for example generative adversarial networks to estimate and model the conditional channel distribution, mutual information estimation as a reward function, or reinforcement learning. In this paper, the approach of reinforcement learning is studied and, in particular, the policy gradient method for a model-free approach of neural network-based secure encoding is investigated. Previously developed techniques for enforcing a certain co-set structure on the encoding process can be combined with recent reinforcement learning approaches. This new approach is evaluated by extensive simulations, and it is demonstrated that the resulting decoding performance of an eavesdropper is capped at a certain error level.

* Accepted for ICC 2021, 6 pages

Via

Access Paper or Ask Questions

Deep Learning for Channel Coding via Neural Mutual Information Estimation

Mar 07, 2019

Rick Fritschek, Rafael F. Schaefer, Gerhard Wunder

Figure 1 for Deep Learning for Channel Coding via Neural Mutual Information Estimation

Figure 2 for Deep Learning for Channel Coding via Neural Mutual Information Estimation

Figure 3 for Deep Learning for Channel Coding via Neural Mutual Information Estimation

Figure 4 for Deep Learning for Channel Coding via Neural Mutual Information Estimation

Abstract:End-to-end deep learning for communication systems, i.e., systems whose encoder and decoder are learned, has attracted significant interest recently, due to its performance which comes close to well-developed classical encoder-decoder designs. However, one of the drawbacks of current learning approaches is that a differentiable channel model is needed for the training of the underlying neural networks. In real-world scenarios, such a channel model is hardly available and often the channel density is not even known at all. Some works, therefore, focus on a generative approach, i.e., generating the channel from samples, or rely on reinforcement learning to circumvent this problem. We present a novel approach which utilizes a recently proposed neural estimator of mutual information. We use this estimator to optimize the encoder for a maximized mutual information, only relying on channel samples. Moreover, we show that our approach achieves the same performance as state-of-the-art end-to-end learning with perfect channel model knowledge.

* 5 pages, 6 figures

Via

Access Paper or Ask Questions