Picture for Yudong Chen

Yudong Chen

The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure

Add code
Oct 28, 2024
Viaarxiv icon

Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Add code
Oct 16, 2024
Viaarxiv icon

The Plug-in Approach for Average-Reward and Discounted MDPs: Optimal Sample Complexity Analysis

Add code
Oct 10, 2024
Viaarxiv icon

Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks

Add code
Oct 05, 2024
Figure 1 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Figure 2 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Figure 3 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Figure 4 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Viaarxiv icon

Stable Offline Value Function Learning with Bisimulation-based Representations

Add code
Oct 02, 2024
Viaarxiv icon

Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows

Add code
Sep 06, 2024
Figure 1 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Figure 2 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Figure 3 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Figure 4 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Viaarxiv icon

Inception: Efficiently Computable Misinformation Attacks on Markov Games

Add code
Jun 24, 2024
Figure 1 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Figure 2 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Viaarxiv icon

When is exponential asymptotic optimality achievable in average-reward restless bandits?

Add code
May 28, 2024
Viaarxiv icon

The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize

Add code
May 27, 2024
Viaarxiv icon

Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA

Add code
Apr 09, 2024
Viaarxiv icon