Picture for Youbang Sun

Youbang Sun

Finite-Time Analysis of Stochastic Nonconvex Nonsmooth Optimization on the Riemannian Manifolds

Add code
Oct 24, 2025
Viaarxiv icon

FlowRL: Matching Reward Distributions for LLM Reasoning

Add code
Sep 18, 2025
Viaarxiv icon

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

Towards a Unified View of Large Language Model Post-Training

Add code
Sep 04, 2025
Figure 1 for Towards a Unified View of Large Language Model Post-Training
Figure 2 for Towards a Unified View of Large Language Model Post-Training
Figure 3 for Towards a Unified View of Large Language Model Post-Training
Figure 4 for Towards a Unified View of Large Language Model Post-Training
Viaarxiv icon

Automating Exploratory Multiomics Research via Language Models

Add code
Jun 09, 2025
Figure 1 for Automating Exploratory Multiomics Research via Language Models
Figure 2 for Automating Exploratory Multiomics Research via Language Models
Figure 3 for Automating Exploratory Multiomics Research via Language Models
Figure 4 for Automating Exploratory Multiomics Research via Language Models
Viaarxiv icon

TTRL: Test-Time Reinforcement Learning

Add code
Apr 22, 2025
Viaarxiv icon

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Add code
Mar 14, 2025
Figure 1 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 2 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 3 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 4 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Viaarxiv icon

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Add code
Dec 23, 2024
Viaarxiv icon

Local Linear Convergence of Infeasible Optimization with Orthogonal Constraints

Add code
Dec 07, 2024
Viaarxiv icon