Picture for Renqing He

Renqing He

State Rank Dynamics in Linear Attention LLMs

Add code
Feb 02, 2026
Viaarxiv icon

From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

LTS-VoiceAgent: A Listen-Think-Speak Framework for Efficient Streaming Voice Interaction via Semantic Triggering and Incremental Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

Efficient Paths and Dense Rewards: Probabilistic Flow Reasoning for Large Language Models

Add code
Jan 14, 2026
Viaarxiv icon

UserLM-R1: Modeling Human Reasoning in User Language Models with Multi-Reward Reinforcement Learning

Add code
Jan 14, 2026
Viaarxiv icon

Long-term Task-oriented Agent: Proactive Long-term Intent Maintenance in Dynamic Environments

Add code
Jan 14, 2026
Viaarxiv icon

GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR

Add code
Jan 14, 2026
Viaarxiv icon

Fine-Mem: Fine-Grained Feedback Alignment for Long-Horizon Memory Management

Add code
Jan 13, 2026
Viaarxiv icon

Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering

Add code
Jan 13, 2026
Viaarxiv icon

Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments

Add code
Jun 20, 2024
Figure 1 for Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments
Figure 2 for Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments
Figure 3 for Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments
Figure 4 for Harvesting Efficient On-Demand Order Pooling from Skilled Couriers: Enhancing Graph Representation Learning for Refining Real-time Many-to-One Assignments
Viaarxiv icon