Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thanh Vinh Vo

Knowledge Sharing and Transfer via Centralized Reward Agent for Multi-Task Reinforcement Learning

Aug 20, 2024

Haozhe Ma, Zhengding Luo, Thanh Vinh Vo, Kuankuan Sima, Tze-Yun Leong

Abstract:Reward shaping is effective in addressing the sparse-reward challenge in reinforcement learning by providing immediate feedback through auxiliary informative rewards. Based on the reward shaping strategy, we propose a novel multi-task reinforcement learning framework, that integrates a centralized reward agent (CRA) and multiple distributed policy agents. The CRA functions as a knowledge pool, which aims to distill knowledge from various tasks and distribute it to individual policy agents to improve learning efficiency. Specifically, the shaped rewards serve as a straightforward metric to encode knowledge. This framework not only enhances knowledge sharing across established tasks but also adapts to new tasks by transferring valuable reward signals. We validate the proposed method on both discrete and continuous domains, demonstrating its robustness in multi-task sparse-reward settings and its effective transferability to unseen tasks.

Via

Access Paper or Ask Questions

Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning

Aug 07, 2024

Haozhe Ma, Zhengding Luo, Thanh Vinh Vo, Kuankuan Sima, Tze-Yun Leong

Abstract:Reward shaping addresses the challenge of sparse rewards in reinforcement learning by constructing denser and more informative reward signals. To achieve self-adaptive and highly efficient reward shaping, we propose a novel method that incorporates success rates derived from historical experiences into shaped rewards. Our approach utilizes success rates sampled from Beta distributions, which dynamically evolve from uncertain to reliable values as more data is collected. Initially, the self-adaptive success rates exhibit more randomness to encourage exploration. Over time, they become more certain to enhance exploitation, thus achieving a better balance between exploration and exploitation. We employ Kernel Density Estimation (KDE) combined with Random Fourier Features (RFF) to derive the Beta distributions, resulting in a computationally efficient implementation in high-dimensional continuous state spaces. This method provides a non-parametric and learning-free approach. The proposed method is evaluated on a wide range of continuous control tasks with sparse and delayed rewards, demonstrating significant improvements in sample efficiency and convergence stability compared to relevant baselines.

Via

Access Paper or Ask Questions

Decoupled Prompt-Adapter Tuning for Continual Activity Recognition

Jul 20, 2024

Di Fu, Thanh Vinh Vo, Haozhe Ma, Tze-Yun Leong

Abstract:Action recognition technology plays a vital role in enhancing security through surveillance systems, enabling better patient monitoring in healthcare, providing in-depth performance analysis in sports, and facilitating seamless human-AI collaboration in domains such as manufacturing and assistive technologies. The dynamic nature of data in these areas underscores the need for models that can continuously adapt to new video data without losing previously acquired knowledge, highlighting the critical role of advanced continual action recognition. To address these challenges, we propose Decoupled Prompt-Adapter Tuning (DPAT), a novel framework that integrates adapters for capturing spatial-temporal information and learnable prompts for mitigating catastrophic forgetting through a decoupled training strategy. DPAT uniquely balances the generalization benefits of prompt tuning with the plasticity provided by adapters in pretrained vision models, effectively addressing the challenge of maintaining model performance amidst continuous data evolution without necessitating extensive finetuning. DPAT consistently achieves state-of-the-art performance across several challenging action recognition benchmarks, thus demonstrating the effectiveness of our model in the domain of continual action recognition.

Via

Access Paper or Ask Questions

Federated Learning of Causal Effects from Incomplete Observational Data

Aug 24, 2023

Thanh Vinh Vo, Young lee, Tze-Yun Leong

Abstract:Decentralized and incomplete data sources are prevalent in real-world applications, posing a formidable challenge for causal inference. These sources cannot be consolidated into a single entity owing to privacy constraints, and the presence of missing values within them can potentially introduce bias to the causal estimands. We introduce a new approach for federated causal inference from incomplete data, enabling the estimation of causal effects from multiple decentralized and incomplete data sources. Our approach disentangles the loss function into multiple components, each corresponding to a specific data source with missing values. Our approach accounts for the missing data under the missing at random assumption, while also estimating higher-order statistics of the causal estimands. Our method recovers the conditional distribution of missing confounders given the observed confounders from the decentralized data sources to identify causal effects. Our framework estimates heterogeneous causal effects without the sharing of raw training data among sources, which helps to mitigate privacy risks. The efficacy of our approach is demonstrated through a collection of simulated and real-world instances, illustrating its potential and practicality.

* Preprint

Via

Access Paper or Ask Questions

An Adaptive Kernel Approach to Federated Learning of Heterogeneous Causal Effects

Jan 01, 2023

Thanh Vinh Vo, Arnab Bhattacharyya, Young Lee, Tze-Yun Leong

Abstract:We propose a new causal inference framework to learn causal effects from multiple, decentralized data sources in a federated setting. We introduce an adaptive transfer algorithm that learns the similarities among the data sources by utilizing Random Fourier Features to disentangle the loss function into multiple components, each of which is associated with a data source. The data sources may have different distributions; the causal effects are independently and systematically incorporated. The proposed method estimates the similarities among the sources through transfer coefficients, and hence requiring no prior information about the similarity measures. The heterogeneous causal effects can be estimated with no sharing of the raw training data among the sources, thus minimizing the risk of privacy leak. We also provide minimax lower bounds to assess the quality of the parameters learned from the disparate sources. The proposed method is empirically shown to outperform the baselines on decentralized data sources with dissimilar distributions.

* NeurIPS 2022

Via

Access Paper or Ask Questions

Adaptive Multi-Source Causal Inference

May 31, 2021

Thanh Vinh Vo, Pengfei Wei, Trong Nghia Hoang, Tze-Yun Leong

Figure 1 for Adaptive Multi-Source Causal Inference

Figure 2 for Adaptive Multi-Source Causal Inference

Figure 3 for Adaptive Multi-Source Causal Inference

Figure 4 for Adaptive Multi-Source Causal Inference

Abstract:Data scarcity is a tremendous challenge in causal effect estimation. In this paper, we propose to exploit additional data sources to facilitate estimating causal effects in the target population. Specifically, we leverage additional source datasets which share similar causal mechanisms with the target observations to help infer causal effects of the target population. We propose three levels of knowledge transfer, through modelling the outcomes, treatments, and confounders. To achieve consistent positive transfer, we introduce learnable parametric transfer factors to adaptively control the transfer strength, and thus achieving a fair and balanced knowledge transfer between the sources and the target. The proposed method can infer causal effects in the target population without prior knowledge of data discrepancy between the additional data sources and the target. Experiments on both synthetic and real-world datasets show the effectiveness of the proposed method as compared with recent baselines.

* Preprint

Via

Access Paper or Ask Questions

Federated Estimation of Causal Effects from Observational Data

May 31, 2021

Thanh Vinh Vo, Trong Nghia Hoang, Young Lee, Tze-Yun Leong

Figure 1 for Federated Estimation of Causal Effects from Observational Data

Figure 2 for Federated Estimation of Causal Effects from Observational Data

Figure 3 for Federated Estimation of Causal Effects from Observational Data

Figure 4 for Federated Estimation of Causal Effects from Observational Data

Abstract:Many modern applications collect data that comes in federated spirit, with data kept locally and undisclosed. Till date, most insight into the causal inference requires data to be stored in a central repository. We present a novel framework for causal inference with federated data sources. We assess and integrate local causal effects from different private data sources without centralizing them. Then, the treatment effects on subjects from observational data using a non-parametric reformulation of the classical potential outcomes framework is estimated. We model the potential outcomes as a random function distributed by Gaussian processes, whose defining parameters can be efficiently learned from multiple data sources, respecting privacy constraints. We demonstrate the promise and efficiency of the proposed approach through a set of simulated and real-world benchmark examples.

* Preprint

Via

Access Paper or Ask Questions

A Causal Modeling Framework with Stochastic Confounders

Apr 27, 2020

Thanh Vinh Vo, Pengfei Wei, Wicher Bergsma, Tze-Yun Leong

Figure 1 for A Causal Modeling Framework with Stochastic Confounders

Figure 2 for A Causal Modeling Framework with Stochastic Confounders

Figure 3 for A Causal Modeling Framework with Stochastic Confounders

Figure 4 for A Causal Modeling Framework with Stochastic Confounders

Abstract:This work aims to extend the current causal inference framework to incorporate stochastic confounders by exploiting the Markov property. We further develop a robust and simple algorithm for accurately estimating the causal effects based on the observed outcomes, treatments, and covariates, without any parametric specification of the components and their relations. This is in contrast to the state-of-the-art approaches that involve careful parameterization of deep neural networks for causal inference. Far from being a triviality, we show that the proposed algorithm has profound significance to temporal data in both a qualitative and quantitative sense.

* preprint, work in progress

Via

Access Paper or Ask Questions