Picture for Yibo Yang

Yibo Yang

Self-Guided Process Reward Optimization with Redefined Step-wise Advantage for Process Reinforcement Learning

Add code
Jul 03, 2025
Viaarxiv icon

Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence

Add code
Jun 16, 2025
Viaarxiv icon

AstroCompress: A benchmark dataset for multi-purpose compression of astronomical data

Add code
Jun 10, 2025
Viaarxiv icon

Optimization-Inspired Few-Shot Adaptation for Large Language Models

Add code
May 25, 2025
Viaarxiv icon

Inference Compute-Optimal Video Vision Language Models

Add code
May 24, 2025
Viaarxiv icon

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Add code
Mar 26, 2025
Viaarxiv icon

Optimizing Singular Spectrum for Large Language Model Compression

Add code
Feb 20, 2025
Viaarxiv icon

Enhancing Generalization via Sharpness-Aware Trajectory Matching for Dataset Condensation

Add code
Feb 03, 2025
Viaarxiv icon

Continuous Knowledge-Preserving Decomposition for Few-Shot Continual Learning

Add code
Jan 09, 2025
Viaarxiv icon

Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization

Add code
Dec 24, 2024
Viaarxiv icon