Picture for Zhepeng Cen

Zhepeng Cen

Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens

Add code
Oct 18, 2024
Figure 1 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Figure 2 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Figure 3 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Figure 4 for Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Viaarxiv icon

OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning

Add code
Jul 19, 2024
Figure 1 for OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Figure 2 for OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Figure 3 for OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Figure 4 for OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Viaarxiv icon

Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Add code
May 20, 2024
Viaarxiv icon

Learning from Sparse Offline Datasets via Conservative Density Estimation

Add code
Jan 16, 2024
Viaarxiv icon

Gradient Shaping for Multi-Constraint Safe Reinforcement Learning

Add code
Dec 23, 2023
Figure 1 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Figure 2 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Figure 3 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Figure 4 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Viaarxiv icon

Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning

Add code
Oct 05, 2023
Viaarxiv icon

Datasets and Benchmarks for Offline Safe Reinforcement Learning

Add code
Jun 16, 2023
Figure 1 for Datasets and Benchmarks for Offline Safe Reinforcement Learning
Figure 2 for Datasets and Benchmarks for Offline Safe Reinforcement Learning
Figure 3 for Datasets and Benchmarks for Offline Safe Reinforcement Learning
Figure 4 for Datasets and Benchmarks for Offline Safe Reinforcement Learning
Viaarxiv icon

Constrained Decision Transformer for Offline Safe Reinforcement Learning

Add code
Feb 14, 2023
Viaarxiv icon

Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability

Add code
Sep 16, 2022
Figure 1 for Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Figure 2 for Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Figure 3 for Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Figure 4 for Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Viaarxiv icon

On the Robustness of Safe Reinforcement Learning under Observational Perturbations

Add code
May 29, 2022
Figure 1 for On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Figure 2 for On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Figure 3 for On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Figure 4 for On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Viaarxiv icon