Picture for Xin Geng

Xin Geng

Trustworthy Federated Label Distribution Learning under Annotation Quality Disparity

Add code
May 06, 2026
Viaarxiv icon

Meta-Aligner: Bidirectional Preference-Policy Optimization for Multi-Objective LLMs Alignment

Add code
Apr 27, 2026
Viaarxiv icon

Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization

Add code
Apr 16, 2026
Viaarxiv icon

MotionRFT: Unified Reinforcement Fine-Tuning for Text-to-Motion Generation

Add code
Mar 28, 2026
Viaarxiv icon

A Creative Agent is Worth a 64-Token Template

Add code
Mar 18, 2026
Viaarxiv icon

VRM: Teaching Reward Models to Understand Authentic Human Preferences

Add code
Mar 05, 2026
Viaarxiv icon

Model Merging in the Essential Subspace

Add code
Feb 23, 2026
Viaarxiv icon

Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training

Add code
Feb 12, 2026
Viaarxiv icon

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Add code
Feb 02, 2026
Viaarxiv icon

Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models

Add code
Jan 28, 2026
Viaarxiv icon