Picture for Thanh Nguyen-Tang

Thanh Nguyen-Tang

Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms

Add code
Nov 01, 2024
Viaarxiv icon

Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Add code
Jul 16, 2024
Figure 1 for Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks
Figure 2 for Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks
Figure 3 for Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks
Figure 4 for Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks
Viaarxiv icon

Offline Multitask Representation Learning for Reinforcement Learning

Add code
Mar 18, 2024
Figure 1 for Offline Multitask Representation Learning for Reinforcement Learning
Viaarxiv icon

On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond

Add code
Jan 06, 2024
Viaarxiv icon

SigFormer: Signature Transformers for Deep Hedging

Add code
Oct 20, 2023
Viaarxiv icon

A Cosine Similarity-based Method for Out-of-Distribution Detection

Add code
Jun 23, 2023
Figure 1 for A Cosine Similarity-based Method for Out-of-Distribution Detection
Figure 2 for A Cosine Similarity-based Method for Out-of-Distribution Detection
Figure 3 for A Cosine Similarity-based Method for Out-of-Distribution Detection
Figure 4 for A Cosine Similarity-based Method for Out-of-Distribution Detection
Viaarxiv icon

VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation

Add code
Mar 04, 2023
Figure 1 for VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Figure 2 for VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Figure 3 for VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Figure 4 for VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Viaarxiv icon

On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation

Add code
Nov 23, 2022
Figure 1 for On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Figure 2 for On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Figure 3 for On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Figure 4 for On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation
Viaarxiv icon

Two-Stage Neural Contextual Bandits for Personalised News Recommendation

Add code
Jun 26, 2022
Figure 1 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Figure 2 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Figure 3 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Figure 4 for Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Viaarxiv icon

On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency

Add code
Mar 03, 2022
Figure 1 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Figure 2 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Figure 3 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Figure 4 for On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency
Viaarxiv icon