Picture for Kaiwen Wang

Kaiwen Wang

Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection

Add code
Aug 18, 2024
Viaarxiv icon

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Add code
Jul 22, 2024
Viaarxiv icon

Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes

Add code
Mar 29, 2024
Viaarxiv icon

Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions

Add code
Mar 23, 2024
Viaarxiv icon

Switching the Loss Reduces the Cost in Batch Reinforcement Learning

Add code
Mar 12, 2024
Viaarxiv icon

Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL

Add code
Mar 10, 2024
Viaarxiv icon

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

Add code
Feb 11, 2024
Viaarxiv icon

JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning

Add code
Jul 21, 2023
Viaarxiv icon

The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

Add code
May 25, 2023
Viaarxiv icon

Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

Add code
Feb 07, 2023
Viaarxiv icon