Picture for Yan Sun

Yan Sun

Talk Less, Verify More: Improving LLM Assistants with Semantic Checks and Execution Feedback

Add code
Jan 07, 2026
Viaarxiv icon

MultiRisk: Multiple Risk Control via Iterative Score Thresholding

Add code
Dec 31, 2025
Viaarxiv icon

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

Add code
Dec 30, 2025
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives

Add code
Sep 26, 2025
Figure 1 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 2 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 3 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 4 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Viaarxiv icon

MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models

Add code
Jun 15, 2025
Viaarxiv icon

Investigating the Effects of Cognitive Biases in Prompts on Large Language Model Outputs

Add code
Jun 14, 2025
Viaarxiv icon

Foundations of Top-$k$ Decoding For Language Models

Add code
May 25, 2025
Figure 1 for Foundations of Top-$k$ Decoding For Language Models
Figure 2 for Foundations of Top-$k$ Decoding For Language Models
Figure 3 for Foundations of Top-$k$ Decoding For Language Models
Figure 4 for Foundations of Top-$k$ Decoding For Language Models
Viaarxiv icon

Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines

Add code
May 21, 2025
Figure 1 for Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines
Figure 2 for Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines
Figure 3 for Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines
Figure 4 for Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines
Viaarxiv icon