Picture for Zaiwen Wen

Zaiwen Wen

Non-Asymptotic Global Convergence of PPO-Clip

Add code
Dec 18, 2025
Viaarxiv icon

Translating Informal Proofs into Formal Proofs Using a Chain of States

Add code
Dec 12, 2025
Viaarxiv icon

Advancing Mathematical Research via Human-AI Interactive Theorem Proving

Add code
Dec 11, 2025
Viaarxiv icon

SITA: A Framework for Structure-to-Instance Theorem Autoformalization

Add code
Nov 13, 2025
Viaarxiv icon

Accelerating Optimization via Differentiable Stopping Time

Add code
May 28, 2025
Viaarxiv icon

LMask: Learn to Solve Constrained Routing Problems with Lazy Masking

Add code
May 23, 2025
Viaarxiv icon

A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models

Add code
Feb 11, 2025
Viaarxiv icon

Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures

Add code
Oct 10, 2024
Figure 1 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Figure 2 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Figure 3 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Figure 4 for Enhancing Zeroth-order Fine-tuning for Language Models with Low-rank Structures
Viaarxiv icon

ODE-based Learning to Optimize

Add code
Jun 04, 2024
Figure 1 for ODE-based Learning to Optimize
Figure 2 for ODE-based Learning to Optimize
Figure 3 for ODE-based Learning to Optimize
Figure 4 for ODE-based Learning to Optimize
Viaarxiv icon

An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks

Add code
May 07, 2024
Viaarxiv icon