Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Suchinthaka Wanninayaka

University of California at Davis, USA

TACO: Rethinking Semantic Communications with Task Adaptation and Context Embedding

May 16, 2025

Achintha Wijesinghe, Weiwei Wang, Suchinthaka Wanninayaka, Songyang Zhang, Zhi Ding

Figure 1 for TACO: Rethinking Semantic Communications with Task Adaptation and Context Embedding

Figure 2 for TACO: Rethinking Semantic Communications with Task Adaptation and Context Embedding

Figure 3 for TACO: Rethinking Semantic Communications with Task Adaptation and Context Embedding

Figure 4 for TACO: Rethinking Semantic Communications with Task Adaptation and Context Embedding

Abstract:Recent advancements in generative artificial intelligence have introduced groundbreaking approaches to innovating next-generation semantic communication, which prioritizes conveying the meaning of a message rather than merely transmitting raw data. A fundamental challenge in semantic communication lies in accurately identifying and extracting the most critical semantic information while adapting to downstream tasks without degrading performance, particularly when the objective at the receiver may evolve over time. To enable flexible adaptation to multiple tasks at the receiver, this work introduces a novel semantic communication framework, which is capable of jointly capturing task-specific information to enhance downstream task performance and contextual information. Through rigorous experiments on popular image datasets and computer vision tasks, our framework shows promising improvement compared to existing work, including superior performance in downstream tasks, better generalizability, ultra-high bandwidth efficiency, and low reconstruction latency.

* Submitted to the IEEE GlobeCom 2025

Via

Access Paper or Ask Questions

Task-Driven Semantic Quantization and Imitation Learning for Goal-Oriented Communications

Feb 25, 2025

Yu-Chieh Chao, Yubei Chen, Weiwei Wang, Achintha Wijesinghe, Suchinthaka Wanninayaka, Songyang Zhang, Zhi Ding

Figure 1 for Task-Driven Semantic Quantization and Imitation Learning for Goal-Oriented Communications

Figure 2 for Task-Driven Semantic Quantization and Imitation Learning for Goal-Oriented Communications

Figure 3 for Task-Driven Semantic Quantization and Imitation Learning for Goal-Oriented Communications

Figure 4 for Task-Driven Semantic Quantization and Imitation Learning for Goal-Oriented Communications

Abstract:Semantic communication marks a new paradigm shift from bit-wise data transmission to semantic information delivery for the purpose of bandwidth reduction. To more effectively carry out specialized downstream tasks at the receiver end, it is crucial to define the most critical semantic message in the data based on the task or goal-oriented features. In this work, we propose a novel goal-oriented communication (GO-COM) framework, namely Goal-Oriented Semantic Variational Autoencoder (GOS-VAE), by focusing on the extraction of the semantics vital to the downstream tasks. Specifically, we adopt a Vector Quantized Variational Autoencoder (VQ-VAE) to compress media data at the transmitter side. Instead of targeting the pixel-wise image data reconstruction, we measure the quality-of-service at the receiver end based on a pre-defined task-incentivized model. Moreover, to capture the relevant semantic features in the data reconstruction, imitation learning is adopted to measure the data regeneration quality in terms of goal-oriented semantics. Our experimental results demonstrate the power of imitation learning in characterizing goal-oriented semantics and bandwidth efficiency of our proposed GOS-VAE.

* Accepted for publication in 2025 International Conference on Communications (IEEE ICC); 6 pages, 4 figures

Via

Access Paper or Ask Questions

LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency

Dec 18, 2024

Achintha Wijesinghe, Suchinthaka Wanninayaka, Weiwei Wang, Yu-Chieh Chao, Songyang Zhang, Zhi Ding

Figure 1 for LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency

Figure 2 for LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency

Figure 3 for LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency

Figure 4 for LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency

Abstract:The recent rise of semantic-style communications includes the development of goal-oriented communications (GOCOMs) remarkably efficient multimedia information transmissions. The concept of GO-COMS leverages advanced artificial intelligence (AI) tools to address the rising demand for bandwidth efficiency in applications, such as edge computing and Internet-of-Things (IoT). Unlike traditional communication systems focusing on source data accuracy, GO-COMs provide intelligent message delivery catering to the special needs critical to accomplishing downstream tasks at the receiver. In this work, we present a novel GO-COM framework, namely LaMI-GO that utilizes emerging generative AI for better quality-of-service (QoS) with ultra-high communication efficiency. Specifically, we design our LaMI-GO system backbone based on a latent diffusion model followed by a vector-quantized generative adversarial network (VQGAN) for efficient latent embedding and information representation. The system trains a common feature codebook the receiver side. Our experimental results demonstrate substantial improvement in perceptual quality, accuracy of downstream tasks, and bandwidth consumption over the state-of-the-art GOCOM systems and establish the power of our proposed LaMI-GO communication framework.

* Under review

Via

Access Paper or Ask Questions

Diff-GO$^\text{n}$: Enhancing Diffusion Models for Goal-Oriented Communications

Dec 09, 2024

Suchinthaka Wanninayaka, Achintha Wijesinghe, Weiwei Wang, Yu-Chieh Chao, Songyang Zhang, Zhi Ding

$Figure 1 for Diff-GO$^\text{n}$: Enhancing Diffusion Models for Goal-Oriented Communications$

$Figure 2 for Diff-GO$^\text{n}$: Enhancing Diffusion Models for Goal-Oriented Communications$

$Figure 3 for Diff-GO$^\text{n}$: Enhancing Diffusion Models for Goal-Oriented Communications$

$Figure 4 for Diff-GO$^\text{n}$: Enhancing Diffusion Models for Goal-Oriented Communications$

Abstract:The rapid expansion of edge devices and Internet-of-Things (IoT) continues to heighten the demand for data transport under limited spectrum resources. The goal-oriented communications (GO-COM), unlike traditional communication systems designed for bit-level accuracy, prioritizes more critical information for specific application goals at the receiver. To improve the efficiency of generative learning models for GO-COM, this work introduces a novel noise-restricted diffusion-based GO-COM (Diff-GO$^\text{n}$) framework for reducing bandwidth overhead while preserving the media quality at the receiver. Specifically, we propose an innovative Noise-Restricted Forward Diffusion (NR-FD) framework to accelerate model training and reduce the computation burden for diffusion-based GO-COMs by leveraging a pre-sampled pseudo-random noise bank (NB). Moreover, we design an early stopping criterion for improving computational efficiency and convergence speed, allowing high-quality generation in fewer training steps. Our experimental results demonstrate superior perceptual quality of data transmission at a reduced bandwidth usage and lower computation, making Diff-GO$^\text{n}$ well-suited for real-time communications and downstream applications.

Via

Access Paper or Ask Questions