Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yin Sun

Multimodal Remote Inference

Aug 11, 2025

Keyuan Zhang, Yin Sun, Bo Ji

Abstract:We consider a remote inference system with multiple modalities, where a multimodal machine learning (ML) model performs real-time inference using features collected from remote sensors. As sensor observations may change dynamically over time, fresh features are critical for inference tasks. However, timely delivering features from all modalities is often infeasible due to limited network resources. To this end, we study a two-modality scheduling problem to minimize the ML model's inference error, which is expressed as a penalty function of AoI for both modalities. We develop an index-based threshold policy and prove its optimality. Specifically, the scheduler switches modalities when the current modality's index function exceeds a threshold. We show that the two modalities share the same threshold, and both the index functions and the threshold can be computed efficiently. The optimality of our policy holds for (i) general AoI functions that are \emph{non-monotonic} and \emph{non-additive} and (ii) \emph{heterogeneous} transmission times. Numerical results show that our policy reduces inference error by up to 55% compared to round-robin and uniform random policies, which are oblivious to the AoI-based inference error function. Our results shed light on how to improve remote inference accuracy by optimizing task-oriented AoI functions.

* Accepted by The 22nd IEEE International Conference on Mobile Ad-Hoc and Smart Systems (MASS 2025)

Via

Access Paper or Ask Questions

Send Pilot or Data? Leveraging Age of Channel State Information for Throughput Maximization

Mar 18, 2025

Sirin Chakraborty, Yin Sun

Figure 1 for Send Pilot or Data? Leveraging Age of Channel State Information for Throughput Maximization

Figure 2 for Send Pilot or Data? Leveraging Age of Channel State Information for Throughput Maximization

Figure 3 for Send Pilot or Data? Leveraging Age of Channel State Information for Throughput Maximization

Figure 4 for Send Pilot or Data? Leveraging Age of Channel State Information for Throughput Maximization

Abstract:In this paper, we study the optimal timing for pilot and data transmissions to maximize effective throughput, also known as goodput, over a wireless fading channel. The receiver utilizes the received pilot signal and its Age of Information (AoI), termed the Age of Channel State Information (AoCSI), to estimate the channel state. Based on this estimation, the transmitter selects an appropriate modulation and coding scheme (MCS) to maximize goodput while ensuring compliance with a predefined block error probability constraint. Furthermore, we design an optimal pilot scheduling policy that determines whether to transmit a pilot or data at each time step, with the objective of maximizing the long-term average goodput. This problem involves a non-monotonic AoI metric optimization challenge, as the goodput function is non-monotonic with respect to AoCSI. The numerical results illustrate the performance gains achieved by the proposed policy under various SNR levels and mobility speeds.

Via

Access Paper or Ask Questions

Timely Remote Estimation with Memory at the Receiver

Jan 03, 2025

Sirin Chakraborty, Yin Sun

Figure 1 for Timely Remote Estimation with Memory at the Receiver

Figure 2 for Timely Remote Estimation with Memory at the Receiver

Figure 3 for Timely Remote Estimation with Memory at the Receiver

Abstract:In this study, we consider a remote estimation system that estimates a time-varying target based on sensor data transmitted over wireless channel. Due to transmission errors, some data packets fail to reach the receiver. To mitigate this, the receiver uses a buffer to store recently received data packets, which allows for more accurate estimation from the incomplete received data. Our research focuses on optimizing the transmission scheduling policy to minimize the estimation error, which is quantified as a function of the age of information vector associated with the buffered packets. Our results show that maintaining a buffer at the receiver results in better estimation performance for non-Markovian sources.

* 5 pages, 3 figures, to be published in the Proceedings of the 2024 Asilomar Conference on Signals, Systems, and Computers, October 27-30, 2023, Pacific Grove, California, USA

Via

Access Paper or Ask Questions

Goal-Oriented Communications for Real-time Inference with Two-Way Delay

Oct 11, 2024

Cagri Ari, Md Kamran Chowdhury Shisher, Yin Sun, Elif Uysal

Figure 1 for Goal-Oriented Communications for Real-time Inference with Two-Way Delay

Figure 2 for Goal-Oriented Communications for Real-time Inference with Two-Way Delay

Figure 3 for Goal-Oriented Communications for Real-time Inference with Two-Way Delay

Figure 4 for Goal-Oriented Communications for Real-time Inference with Two-Way Delay

Abstract:We design a goal-oriented communication strategy for remote inference, where an intelligent model (e.g., a pre-trained neural network) at the receiver side predicts the real-time value of a target signal based on data packets transmitted from a remote location. The inference error depends on both the Age of Information (AoI) and the length of the data packets. Previous formulations of this problem either assumed IID transmission delays with immediate feedback or focused only on monotonic relations where inference performance degrades as the input data ages. In contrast, we consider a possibly non-monotonic relationship between the inference error and AoI. We show how to minimize the expected time-average inference error under two-way delay, where the delay process can have memory. Simulation results highlight the significant benefits of adopting such a goal-oriented communication strategy for remote inference, especially under highly variable delay scenarios.

* 12 pages, 8 figures

Via

Access Paper or Ask Questions

Timely Communications for Remote Inference

Apr 25, 2024

Md Kamran Chowdhury Shisher, Yin Sun, I-Hong Hou

Figure 1 for Timely Communications for Remote Inference

Figure 2 for Timely Communications for Remote Inference

Figure 3 for Timely Communications for Remote Inference

Figure 4 for Timely Communications for Remote Inference

Abstract:In this paper, we analyze the impact of data freshness on remote inference systems, where a pre-trained neural network infers a time-varying target (e.g., the locations of vehicles and pedestrians) based on features (e.g., video frames) observed at a sensing node (e.g., a camera). One might expect that the performance of a remote inference system degrades monotonically as the feature becomes stale. Using an information-theoretic analysis, we show that this is true if the feature and target data sequence can be closely approximated as a Markov chain, whereas it is not true if the data sequence is far from Markovian. Hence, the inference error is a function of Age of Information (AoI), where the function could be non-monotonic. To minimize the inference error in real-time, we propose a new "selection-from-buffer" model for sending the features, which is more general than the "generate-at-will" model used in earlier studies. In addition, we design low-complexity scheduling policies to improve inference performance. For single-source, single-channel systems, we provide an optimal scheduling policy. In multi-source, multi-channel systems, the scheduling problem becomes a multi-action restless multi-armed bandit problem. For this setting, we design a new scheduling policy by integrating Whittle index-based source selection and duality-based feature selection-from-buffer algorithms. This new scheduling policy is proven to be asymptotically optimal. These scheduling results hold for minimizing general AoI functions (monotonic or non-monotonic). Data-driven evaluations demonstrate the significant advantages of our proposed scheduling policies.

* arXiv admin note: text overlap with arXiv:2208.06948

Via

Access Paper or Ask Questions

On the Monotonicity of Information Aging

Mar 06, 2024

MD Kamran Chowdhury Shisher, Yin Sun

Figure 1 for On the Monotonicity of Information Aging

Figure 2 for On the Monotonicity of Information Aging

Abstract:In this paper, we analyze the monotonicity of information aging in a remote estimation system, where historical observations of a Gaussian autoregressive AR(p) process are used to predict its future values. We consider two widely used loss functions in estimation: (i) logarithmic loss function for maximum likelihood estimation and (ii) quadratic loss function for MMSE estimation. The estimation error of the AR(p) process is written as a generalized conditional entropy which has closed-form expressions. By using a new information-theoretic tool called $\epsilon$-Markov chain, we can evaluate the divergence of the AR(p) process from being a Markov chain. When the divergence $\epsilon$ is large, the estimation error of the AR(p) process can be far from a non-decreasing function of the Age of Information (AoI). Conversely, for small divergence $\epsilon$, the inference error is close to a non-decreasing AoI function. Each observation is a short sequence taken from the AR(p) process. As the observation sequence length increases, the parameter $\epsilon$ progressively reduces to zero, and hence the estimation error becomes a non-decreasing AoI function. These results underscore a connection between the monotonicity of information aging and the divergence of from being a Markov chain.

* Part of this work has been accepted by IEEE INFOCOM ASoI Workshop, 2024

Via

Access Paper or Ask Questions

Learning-augmented Online Minimization of Age of Information and Transmission Costs

Mar 05, 2024

Zhongdong Liu, Keyuan Zhang, Bin Li, Yin Sun, Y. Thomas Hou, Bo Ji

Figure 1 for Learning-augmented Online Minimization of Age of Information and Transmission Costs

Figure 2 for Learning-augmented Online Minimization of Age of Information and Transmission Costs

Figure 3 for Learning-augmented Online Minimization of Age of Information and Transmission Costs

Figure 4 for Learning-augmented Online Minimization of Age of Information and Transmission Costs

Abstract:We consider a discrete-time system where a resource-constrained source (e.g., a small sensor) transmits its time-sensitive data to a destination over a time-varying wireless channel. Each transmission incurs a fixed transmission cost (e.g., energy cost), and no transmission results in a staleness cost represented by the Age-of-Information. The source must balance the tradeoff between transmission and staleness costs. To address this challenge, we develop a robust online algorithm to minimize the sum of transmission and staleness costs, ensuring a worst-case performance guarantee. While online algorithms are robust, they are usually overly conservative and may have a poor average performance in typical scenarios. In contrast, by leveraging historical data and prediction models, machine learning (ML) algorithms perform well in average cases. However, they typically lack worst-case performance guarantees. To achieve the best of both worlds, we design a learning-augmented online algorithm that exhibits two desired properties: (i) consistency: closely approximating the optimal offline algorithm when the ML prediction is accurate and trusted; (ii) robustness: ensuring worst-case performance guarantee even ML predictions are inaccurate. Finally, we perform extensive simulations to show that our online algorithm performs well empirically and that our learning-augmented algorithm achieves both consistency and robustness.

* A preliminary version of this work is to be presented at IEEE INFOCOM 2024 Age and Semantics of Information Workshop

Via

Access Paper or Ask Questions

Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

Oct 04, 2023

Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

Figure 1 for Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

Figure 2 for Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

Figure 3 for Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

Abstract:This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM). The generalized concentrability condition establishes a framework that interpolates and extends the existing hypotheses of Markov chain Hoeffding-type inequalities. The flexibility of our framework allows Hoeffding's inequality to be applied beyond the ergodic Markov chains in the traditional sense. We demonstrate the utility by applying our framework to several non-asymptotic analyses arising from the field of machine learning, including (i) a generalization bound for empirical risk minimization with Markovian samples, (ii) a finite sample guarantee for Ployak-Ruppert averaging of SGD, and (iii) a new regret bound for rested Markovian bandits with general state space.

Via

Access Paper or Ask Questions

How Does Data Freshness Affect Real-time Supervised Learning?

Aug 15, 2022

Md Kamran Chowdhury Shisher, Yin Sun

Figure 1 for How Does Data Freshness Affect Real-time Supervised Learning?

Figure 2 for How Does Data Freshness Affect Real-time Supervised Learning?

Figure 3 for How Does Data Freshness Affect Real-time Supervised Learning?

Figure 4 for How Does Data Freshness Affect Real-time Supervised Learning?

Abstract:In this paper, we analyze the impact of data freshness on real-time supervised learning, where a neural network is trained to infer a time-varying target (e.g., the position of the vehicle in front) based on features (e.g., video frames) observed at a sensing node (e.g., camera or lidar). One might expect that the performance of real-time supervised learning degrades monotonically as the feature becomes stale. Using an information-theoretic analysis, we show that this is true if the feature and target data sequence can be closely approximated as a Markov chain; it is not true if the data sequence is far from Markovian. Hence, the prediction error of real-time supervised learning is a function of the Age of Information (AoI), where the function could be non-monotonic. Several experiments are conducted to illustrate the monotonic and non-monotonic behaviors of the prediction error. To minimize the inference error in real-time, we propose a new "selection-from-buffer" model for sending the features, which is more general than the "generate-at-will" model used in earlier studies. By using Gittins and Whittle indices, low-complexity scheduling strategies are developed to minimize the inference error, where a new connection between the Gittins index theory and Age of Information (AoI) minimization is discovered. These scheduling results hold (i) for minimizing general AoI functions (monotonic or non-monotonic) and (ii) for general feature transmission time distributions. Data-driven evaluations are presented to illustrate the benefits of the proposed scheduling algorithms.

* 20 Pages, 11 figures, Part of this work has been accepted by ACM MobiHoc, 2022

Via

Access Paper or Ask Questions

A Local Geometric Interpretation of Feature Extraction in Deep Feedforward Neural Networks

Feb 10, 2022

Md Kamran Chowdhury Shisher, Tasmeen Zaman Ornee, Yin Sun

Figure 1 for A Local Geometric Interpretation of Feature Extraction in Deep Feedforward Neural Networks

Abstract:In this paper, we present a local geometric analysis to interpret how deep feedforward neural networks extract low-dimensional features from high-dimensional data. Our study shows that, in a local geometric region, the optimal weight in one layer of the neural network and the optimal feature generated by the previous layer comprise a low-rank approximation of a matrix that is determined by the Bayes action of this layer. This result holds (i) for analyzing both the output layer and the hidden layers of the neural network, and (ii) for neuron activation functions with non-vanishing gradients. We use two supervised learning problems to illustrate our results: neural network based maximum likelihood classification (i.e., softmax regression) and neural network based minimum mean square estimation. Experimental validation of these theoretical results will be conducted in our future work.

Via

Access Paper or Ask Questions