Picture for Zhengpeng Xie

Zhengpeng Xie

The Meta-Representation Hypothesis

Add code
Jan 05, 2025
Viaarxiv icon

Simple Policy Optimization

Add code
Jan 29, 2024
Viaarxiv icon

Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods

Add code
Nov 03, 2023
Viaarxiv icon