Picture for Da Li

Da Li

Dynamic Expert Sharing: Decoupling Memory from Parallelism in Mixture-of-Experts Diffusion LLMs

Add code
Jan 31, 2026
Viaarxiv icon

Urban Neural Surface Reconstruction from Constrained Sparse Aerial Imagery with 3D SAR Fusion

Add code
Jan 29, 2026
Viaarxiv icon

MERGETUNE: Continued fine-tuning of vision-language models

Add code
Jan 16, 2026
Viaarxiv icon

OMG-Bench: A New Challenging Benchmark for Skeleton-based Online Micro Hand Gesture Recognition

Add code
Dec 18, 2025
Viaarxiv icon

Run, Ruminate, and Regulate: A Dual-process Thinking System for Vision-and-Language Navigation

Add code
Nov 18, 2025
Viaarxiv icon

One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow

Add code
Nov 17, 2025
Viaarxiv icon

Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding

Add code
Nov 11, 2025
Viaarxiv icon

SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models

Add code
Aug 10, 2025
Figure 1 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Figure 2 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Figure 3 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Figure 4 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Viaarxiv icon

HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models

Add code
Aug 06, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon