Picture for Xuefeng Xiao

Xuefeng Xiao

OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training

Add code
Mar 31, 2025
Viaarxiv icon

Training-free Diffusion Acceleration with Bottleneck Sampling

Add code
Mar 27, 2025
Viaarxiv icon

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Add code
Mar 10, 2025
Viaarxiv icon

RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories

Add code
Mar 10, 2025
Viaarxiv icon

Training-free and Adaptive Sparse Attention for Efficient Long Video Generation

Add code
Feb 28, 2025
Viaarxiv icon

Diffusion Adversarial Post-Training for One-Step Video Generation

Add code
Jan 14, 2025
Viaarxiv icon

OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

Add code
Dec 19, 2024
Figure 1 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 2 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 3 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 4 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Viaarxiv icon

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Viaarxiv icon

Data-Centric and Heterogeneity-Adaptive Sequence Parallelism for Efficient LLM Training

Add code
Dec 02, 2024
Viaarxiv icon

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Figure 1 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 2 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 3 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 4 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Viaarxiv icon