Picture for Hongming Shan

Hongming Shan

Towards Interpretable Counterfactual Generation via Multimodal Autoregression

Add code
Mar 29, 2025
Viaarxiv icon

Shushing! Let's Imagine an Authentic Speech from the Silent Video

Add code
Mar 19, 2025
Viaarxiv icon

DreamRelation: Relation-Centric Video Customization

Add code
Mar 10, 2025
Viaarxiv icon

Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language Model

Add code
Mar 02, 2025
Viaarxiv icon

Autoregressive Medical Image Segmentation via Next-Scale Mask Prediction

Add code
Feb 28, 2025
Viaarxiv icon

Emotional Face-to-Speech

Add code
Feb 03, 2025
Figure 1 for Emotional Face-to-Speech
Figure 2 for Emotional Face-to-Speech
Figure 3 for Emotional Face-to-Speech
Figure 4 for Emotional Face-to-Speech
Viaarxiv icon

Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction

Add code
Jan 26, 2025
Figure 1 for Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction
Figure 2 for Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction
Figure 3 for Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction
Figure 4 for Radiologist-in-the-Loop Self-Training for Generalizable CT Metal Artifact Reduction
Viaarxiv icon

Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning

Add code
Jan 07, 2025
Viaarxiv icon

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Add code
Oct 17, 2024
Figure 1 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 2 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 3 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 4 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Viaarxiv icon

AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation

Add code
Oct 08, 2024
Viaarxiv icon