Picture for Wenhan Luo

Wenhan Luo

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

Add code
Mar 13, 2025
Viaarxiv icon

FedDyMem: Efficient Federated Learning with Dynamic Memory and Memory-Reduce for Unsupervised Image Anomaly Detection

Add code
Feb 28, 2025
Viaarxiv icon

Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents

Add code
Feb 27, 2025
Figure 1 for Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents
Figure 2 for Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents
Figure 3 for Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents
Figure 4 for Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents
Viaarxiv icon

VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer

Add code
Feb 09, 2025
Viaarxiv icon

MB-TaylorFormer V2: Improved Multi-branch Linear Transformer Expanded by Taylor Formula for Image Restoration

Add code
Jan 08, 2025
Viaarxiv icon

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Add code
Dec 10, 2024
Figure 1 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Figure 2 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Figure 3 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Figure 4 for StyleMaster: Stylize Your Video with Artistic Generation and Translation
Viaarxiv icon

DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model

Add code
Dec 08, 2024
Figure 1 for DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
Figure 2 for DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
Figure 3 for DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
Figure 4 for DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
Viaarxiv icon

SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model

Add code
Dec 04, 2024
Viaarxiv icon

Foundation Cures Personalization: Recovering Facial Personalized Models' Prompt Consistency

Add code
Nov 22, 2024
Viaarxiv icon

EVA: An Embodied World Model for Future Video Anticipation

Add code
Oct 20, 2024
Figure 1 for EVA: An Embodied World Model for Future Video Anticipation
Figure 2 for EVA: An Embodied World Model for Future Video Anticipation
Figure 3 for EVA: An Embodied World Model for Future Video Anticipation
Figure 4 for EVA: An Embodied World Model for Future Video Anticipation
Viaarxiv icon