Picture for Dandan Zheng

Dandan Zheng

PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning

Add code
Oct 22, 2025
Viaarxiv icon

Ming-Omni: A Unified Multimodal Model for Perception and Generation

Add code
Jun 11, 2025
Viaarxiv icon

Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction

Add code
May 05, 2025
Viaarxiv icon

A Continual Learning-driven Model for Accurate and Generalizable Segmentation of Clinically Comprehensive and Fine-grained Whole-body Anatomies in CT

Add code
Mar 16, 2025
Viaarxiv icon

From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans

Add code
Mar 11, 2025
Figure 1 for From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans
Figure 2 for From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans
Figure 3 for From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans
Figure 4 for From Slices to Sequences: Autoregressive Tracking Transformer for Cohesive and Consistent 3D Lymph Node Detection in CT Scans
Viaarxiv icon

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Add code
Dec 08, 2024
Figure 1 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 2 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 3 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 4 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Viaarxiv icon

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

LumiSculpt: A Consistency Lighting Control Network for Video Generation

Add code
Oct 30, 2024
Figure 1 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 2 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 3 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 4 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Viaarxiv icon

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Add code
Oct 14, 2024
Figure 1 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Figure 2 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Figure 3 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Figure 4 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Viaarxiv icon

3DGAUnet: 3D generative adversarial networks with a 3D U-Net based generator to achieve the accurate and effective synthesis of clinical tumor image data for pancreatic cancer

Add code
Nov 27, 2023
Viaarxiv icon