Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Language-Oriented Semantic Latent Representation for Image Transmission

May 16, 2024

Giordano Cicchetti, Eleonora Grassucci, Jihong Park, Jinho Choi, Sergio Barbarossa, Danilo Comminiello

Figure 1 for Language-Oriented Semantic Latent Representation for Image Transmission

Figure 2 for Language-Oriented Semantic Latent Representation for Image Transmission

Figure 3 for Language-Oriented Semantic Latent Representation for Image Transmission

Figure 4 for Language-Oriented Semantic Latent Representation for Image Transmission

Share this with someone who'll enjoy it:

Abstract:In the new paradigm of semantic communication (SC), the focus is on delivering meanings behind bits by extracting semantic information from raw data. Recent advances in data-to-text models facilitate language-oriented SC, particularly for text-transformed image communication via image-to-text (I2T) encoding and text-to-image (T2I) decoding. However, although semantically aligned, the text is too coarse to precisely capture sophisticated visual features such as spatial locations, color, and texture, incurring a significant perceptual difference between intended and reconstructed images. To address this limitation, in this paper, we propose a novel language-oriented SC framework that communicates both text and a compressed image embedding and combines them using a latent diffusion model to reconstruct the intended image. Experimental results validate the potential of our approach, which transmits only 2.09\% of the original image size while achieving higher perceptual similarities in noisy communication channels compared to a baseline SC method that communicates only through text.The code is available at https://github.com/ispamm/Img2Img-SC/ .

* Under review at IEEE International Workshop on Machine Learning for Signal Processing (MLSP) 2024

View paper on

Share this with someone who'll enjoy it:

Title:Language-Oriented Semantic Latent Representation for Image Transmission

Paper and Code