Picture for Yifan Yang

Yifan Yang

EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents

Add code
May 11, 2026
Viaarxiv icon

Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training

Add code
May 10, 2026
Viaarxiv icon

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Add code
May 10, 2026
Viaarxiv icon

WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling

Add code
May 07, 2026
Viaarxiv icon

SOAR: Real-Time Joint Optimization of Order Allocation and Robot Scheduling in Robotic Mobile Fulfillment Systems

Add code
May 05, 2026
Viaarxiv icon

Toward Multimodal Conversational AI for Age-Related Macular Degeneration

Add code
Apr 28, 2026
Viaarxiv icon

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Add code
Apr 27, 2026
Viaarxiv icon

MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation

Add code
Apr 16, 2026
Viaarxiv icon

AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation

Add code
Apr 09, 2026
Viaarxiv icon

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Add code
Apr 06, 2026
Viaarxiv icon