Picture for Xiang Wang

Xiang Wang

Victor

Taming Consistency Distillation for Accelerated Human Image Animation

Add code
Apr 15, 2025
Viaarxiv icon

DMPT: Decoupled Modality-aware Prompt Tuning for Multi-modal Object Re-identification

Add code
Apr 15, 2025
Viaarxiv icon

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Add code
Apr 15, 2025
Viaarxiv icon

SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models

Add code
Apr 09, 2025
Viaarxiv icon

Exploring the Evolution of Physics Cognition in Video Generation: A Survey

Add code
Mar 27, 2025
Viaarxiv icon

Towards Unified Latent Space for 3D Molecular Latent Diffusion Modeling

Add code
Mar 19, 2025
Viaarxiv icon

Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation

Add code
Mar 12, 2025
Viaarxiv icon

Route Sparse Autoencoder to Interpret Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon

ACE: Concept Editing in Diffusion Models without Performance Degradation

Add code
Mar 11, 2025
Viaarxiv icon

DreamRelation: Relation-Centric Video Customization

Add code
Mar 10, 2025
Viaarxiv icon