Picture for Zhang-Wei Hong

Zhang-Wei Hong

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Add code
Feb 04, 2025
Viaarxiv icon

Embodied Red Teaming for Auditing Robotic Foundation Models

Add code
Nov 27, 2024
Viaarxiv icon

ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning

Add code
Oct 28, 2024
Figure 1 for ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning
Figure 2 for ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning
Figure 3 for ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning
Figure 4 for ImageNet-RIB Benchmark: Large Pre-Training Datasets Don't Guarantee Robustness after Fine-Tuning
Viaarxiv icon

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Add code
Oct 17, 2024
Viaarxiv icon

Random Latent Exploration for Deep Reinforcement Learning

Add code
Jul 18, 2024
Figure 1 for Random Latent Exploration for Deep Reinforcement Learning
Figure 2 for Random Latent Exploration for Deep Reinforcement Learning
Figure 3 for Random Latent Exploration for Deep Reinforcement Learning
Figure 4 for Random Latent Exploration for Deep Reinforcement Learning
Viaarxiv icon

ROER: Regularized Optimal Experience Replay

Add code
Jul 04, 2024
Figure 1 for ROER: Regularized Optimal Experience Replay
Figure 2 for ROER: Regularized Optimal Experience Replay
Figure 3 for ROER: Regularized Optimal Experience Replay
Figure 4 for ROER: Regularized Optimal Experience Replay
Viaarxiv icon

Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models

Add code
Jun 06, 2024
Viaarxiv icon

Curiosity-driven Red-teaming for Large Language Models

Add code
Feb 29, 2024
Viaarxiv icon

Neuro-Inspired Fragmentation and Recall to Overcome Catastrophic Forgetting in Curiosity

Add code
Oct 26, 2023
Viaarxiv icon

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Add code
Oct 12, 2023
Viaarxiv icon