Picture for Lei Song

Lei Song

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Add code
Dec 26, 2025
Viaarxiv icon

SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents

Add code
Dec 08, 2025
Viaarxiv icon

Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning

Add code
Oct 16, 2025
Viaarxiv icon

Rethinking Reward Models for Multi-Domain Test-Time Scaling

Add code
Oct 02, 2025
Viaarxiv icon

Sample-efficient LLM Optimization with Reset Replay

Add code
Aug 08, 2025
Figure 1 for Sample-efficient LLM Optimization with Reset Replay
Figure 2 for Sample-efficient LLM Optimization with Reset Replay
Figure 3 for Sample-efficient LLM Optimization with Reset Replay
Figure 4 for Sample-efficient LLM Optimization with Reset Replay
Viaarxiv icon

HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

Add code
Jun 18, 2025
Viaarxiv icon

Learning to Select In-Context Demonstration Preferred by Large Language Model

Add code
May 26, 2025
Viaarxiv icon

Instance-Prototype Affinity Learning for Non-Exemplar Continual Graph Learning

Add code
May 15, 2025
Viaarxiv icon

OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval

Add code
May 10, 2025
Figure 1 for OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
Figure 2 for OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
Figure 3 for OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
Figure 4 for OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
Viaarxiv icon

Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions

Add code
May 01, 2025
Figure 1 for Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Figure 2 for Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Figure 3 for Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Figure 4 for Multi-Constraint Safe Reinforcement Learning via Closed-form Solution for Log-Sum-Exp Approximation of Control Barrier Functions
Viaarxiv icon