Picture for Kaiwen Wang

Kaiwen Wang

Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection

Add code
Aug 18, 2024
Viaarxiv icon

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Add code
Jul 22, 2024
Figure 1 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 2 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 3 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Figure 4 for Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Viaarxiv icon

Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes

Add code
Mar 29, 2024
Viaarxiv icon

Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions

Add code
Mar 23, 2024
Viaarxiv icon

Switching the Loss Reduces the Cost in Batch Reinforcement Learning

Add code
Mar 12, 2024
Figure 1 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 2 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 3 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 4 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Viaarxiv icon

Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL

Add code
Mar 10, 2024
Viaarxiv icon

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

Add code
Feb 11, 2024
Viaarxiv icon

JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning

Add code
Jul 21, 2023
Viaarxiv icon

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

Add code
May 25, 2023
Figure 1 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Figure 2 for The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Viaarxiv icon

Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

Add code
Feb 07, 2023
Viaarxiv icon