Picture for Zhiqi Lin

Zhiqi Lin

Don't Forget Its Variance! The Minimum Path Variance Principle for Accurate and Stable Score-Based Density Ratio Estimation

Add code
Jan 31, 2026
Viaarxiv icon

Virtual Width Networks

Add code
Nov 17, 2025
Figure 1 for Virtual Width Networks
Figure 2 for Virtual Width Networks
Figure 3 for Virtual Width Networks
Figure 4 for Virtual Width Networks
Viaarxiv icon

Any-Step Density Ratio Estimation via Interval-Annealed Secant Alignment

Add code
Sep 05, 2025
Figure 1 for Any-Step Density Ratio Estimation via Interval-Annealed Secant Alignment
Figure 2 for Any-Step Density Ratio Estimation via Interval-Annealed Secant Alignment
Figure 3 for Any-Step Density Ratio Estimation via Interval-Annealed Secant Alignment
Figure 4 for Any-Step Density Ratio Estimation via Interval-Annealed Secant Alignment
Viaarxiv icon

Truncated Proximal Policy Optimization

Add code
Jun 18, 2025
Figure 1 for Truncated Proximal Policy Optimization
Figure 2 for Truncated Proximal Policy Optimization
Figure 3 for Truncated Proximal Policy Optimization
Figure 4 for Truncated Proximal Policy Optimization
Viaarxiv icon

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Add code
Apr 08, 2025
Figure 1 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 2 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 3 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Viaarxiv icon

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Figure 1 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 2 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 3 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 4 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Viaarxiv icon

Natural Language Fine-Tuning

Add code
Dec 29, 2024
Viaarxiv icon

Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations

Add code
Aug 12, 2024
Figure 1 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Figure 2 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Figure 3 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Figure 4 for Fully Bayesian Differential Gaussian Processes through Stochastic Differential Equations
Viaarxiv icon

Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling

Add code
Aug 07, 2024
Figure 1 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Figure 2 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Figure 3 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Figure 4 for Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling
Viaarxiv icon

Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search

Add code
Nov 26, 2023
Viaarxiv icon