Picture for Zhiqi Lin

Zhiqi Lin

Any-Step Density Ratio Estimation via Interval-Annealed Secant Alignment

Add code
Sep 05, 2025
Viaarxiv icon

Truncated Proximal Policy Optimization

Add code
Jun 18, 2025
Viaarxiv icon

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Add code
Apr 08, 2025
Figure 1 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 2 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 3 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Viaarxiv icon

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Viaarxiv icon

Natural Language Fine-Tuning

Add code
Dec 29, 2024
Viaarxiv icon

Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations

Add code
Aug 12, 2024
Figure 1 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Figure 2 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Figure 3 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Figure 4 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Viaarxiv icon

Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling

Add code
Aug 07, 2024
Figure 1 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Figure 2 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Figure 3 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Figure 4 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Viaarxiv icon

Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search

Add code
Nov 26, 2023
Viaarxiv icon

SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction

Add code
Jan 21, 2023
Figure 1 for SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Figure 2 for SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Figure 3 for SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Figure 4 for SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction
Viaarxiv icon