Picture for Yifan Yang

Yifan Yang

HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models

Add code
Mar 14, 2025
Viaarxiv icon

Simulating Automotive Radar with Lidar and Camera Inputs

Add code
Mar 11, 2025
Viaarxiv icon

StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition

Add code
Mar 08, 2025
Viaarxiv icon

Wanda++: Pruning Large Language Models via Regional Gradients

Add code
Mar 06, 2025
Viaarxiv icon

Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

Add code
Mar 06, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision

Add code
Feb 19, 2025
Viaarxiv icon

QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

VoLUT: Efficient Volumetric streaming enhanced by LUT-based super-resolution

Add code
Feb 17, 2025
Viaarxiv icon

MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon