Picture for Yudong Chen

Yudong Chen

One-step full gradient suffices for low-rank fine-tuning, provably and efficiently

Add code
Feb 03, 2025
Viaarxiv icon

Re-examining Double Descent and Scaling Laws under Norm-based Capacity via Deterministic Equivalence

Add code
Feb 03, 2025
Viaarxiv icon

The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure

Add code
Oct 28, 2024
Viaarxiv icon

Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Add code
Oct 16, 2024
Viaarxiv icon

The Plug-in Approach for Average-Reward and Discounted MDPs: Optimal Sample Complexity Analysis

Add code
Oct 10, 2024
Viaarxiv icon

Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks

Add code
Oct 05, 2024
Figure 1 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Figure 2 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Figure 3 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Figure 4 for Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks
Viaarxiv icon

Stable Offline Value Function Learning with Bisimulation-based Representations

Add code
Oct 02, 2024
Figure 1 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 2 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 3 for Stable Offline Value Function Learning with Bisimulation-based Representations
Figure 4 for Stable Offline Value Function Learning with Bisimulation-based Representations
Viaarxiv icon

Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows

Add code
Sep 06, 2024
Figure 1 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Figure 2 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Figure 3 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Figure 4 for Entry-Specific Matrix Estimation under Arbitrary Sampling Patterns through the Lens of Network Flows
Viaarxiv icon

Inception: Efficiently Computable Misinformation Attacks on Markov Games

Add code
Jun 24, 2024
Figure 1 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Figure 2 for Inception: Efficiently Computable Misinformation Attacks on Markov Games
Viaarxiv icon

When is exponential asymptotic optimality achievable in average-reward restless bandits?

Add code
May 28, 2024
Viaarxiv icon