Picture for Yifan Yang

Yifan Yang

SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only Passes

Add code
Jun 26, 2025
Viaarxiv icon

StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling

Add code
Jun 14, 2025
Viaarxiv icon

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Add code
Jun 06, 2025
Viaarxiv icon

Knowledge-guided Contextual Gene Set Analysis Using Large Language Models

Add code
Jun 04, 2025
Viaarxiv icon

ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL

Add code
May 30, 2025
Viaarxiv icon

FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression

Add code
May 29, 2025
Viaarxiv icon

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

Add code
May 26, 2025
Viaarxiv icon

$C^3$-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking

Add code
May 24, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

ViaRL: Adaptive Temporal Grounding via Visual Iterated Amplification Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon