Abstract:This work introduces Semantically Masked VQ-GAN (SQ-GAN), a novel approach integrating generative models to optimize image compression for semantic/task-oriented communications. SQ-GAN employs off-the-shelf semantic semantic segmentation and a new specifically developed semantic-conditioned adaptive mask module (SAMM) to selectively encode semantically significant features of the images. SQ-GAN outperforms state-of-the-art image compression schemes such as JPEG2000 and BPG across multiple metrics, including perceptual quality and semantic segmentation accuracy on the post-decoding reconstructed image, at extreme low compression rates expressed in bits per pixel.
Abstract:Whenever communication takes place to fulfil a goal, an effective way to encode the source data to be transmitted is to use an encoding rule that allows the receiver to meet the requirements of the goal. A formal way to identify the relevant information with respect to a goal can be obtained exploiting the information bottleneck (IB) principle. In this paper, we propose a goal-oriented communication system, based on the combination of IB and stochastic optimization. The IB principle is used to design the encoder in order to find an optimal balance between representation complexity and relevance of the encoded data with respect to the goal. Stochastic optimization is then used to adapt the parameters of the IB to find an efficient resource allocation of communication and computation resources. Our goal is to minimize the average energy consumption under constraints on average service delay and accuracy of the learning task applied to the received data in a dynamic scenario. Numerical results assess the performance of the proposed strategy in two cases: regression from Gaussian random variables, where we can exploit closed-form solutions, and image classification using deep neural networks, with adaptive network splitting between transmit and receive sides.