Picture for Xin-Qiang Cai

Xin-Qiang Cai

Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning

Add code
Oct 26, 2024
Figure 1 for Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning
Figure 2 for Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning
Figure 3 for Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning
Figure 4 for Beyond Simple Sum of Delayed Rewards: Non-Markovian Reward Modeling for Reinforcement Learning
Viaarxiv icon

Soft-Label Integration for Robust Toxicity Classification

Add code
Oct 18, 2024
Viaarxiv icon

Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains

Add code
Apr 11, 2024
Viaarxiv icon

An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video

Add code
Apr 10, 2024
Figure 1 for An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video
Figure 2 for An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video
Figure 3 for An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video
Figure 4 for An Animation-based Augmentation Approach for Action Recognition from Discontinuous Video
Viaarxiv icon

Reinforcement Learning from Bagged Reward: A Transformer-based Approach for Instance-Level Reward Redistribution

Add code
Feb 06, 2024
Viaarxiv icon

Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding

Add code
Sep 16, 2023
Viaarxiv icon

Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations

Add code
Jun 17, 2021
Figure 1 for Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations
Figure 2 for Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations
Figure 3 for Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations
Figure 4 for Seeing Differently, Acting Similarly: Imitation Learning with Heterogeneous Observations
Viaarxiv icon

Expert-Level Atari Imitation Learning from Demonstrations Only

Add code
Sep 09, 2019
Figure 1 for Expert-Level Atari Imitation Learning from Demonstrations Only
Figure 2 for Expert-Level Atari Imitation Learning from Demonstrations Only
Figure 3 for Expert-Level Atari Imitation Learning from Demonstrations Only
Figure 4 for Expert-Level Atari Imitation Learning from Demonstrations Only
Viaarxiv icon