Picture for Wenhan Luo

Wenhan Luo

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Add code
Dec 10, 2024
Viaarxiv icon

DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model

Add code
Dec 08, 2024
Viaarxiv icon

SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model

Add code
Dec 04, 2024
Viaarxiv icon

Foundation Cures Personalization: Recovering Facial Personalized Models' Prompt Consistency

Add code
Nov 22, 2024
Viaarxiv icon

EVA: An Embodied World Model for Future Video Anticipation

Add code
Oct 20, 2024
Figure 1 for EVA: An Embodied World Model for Future Video Anticipation
Figure 2 for EVA: An Embodied World Model for Future Video Anticipation
Figure 3 for EVA: An Embodied World Model for Future Video Anticipation
Figure 4 for EVA: An Embodied World Model for Future Video Anticipation
Viaarxiv icon

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Add code
Sep 16, 2024
Viaarxiv icon

HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Add code
Sep 04, 2024
Viaarxiv icon

NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models

Add code
Aug 18, 2024
Figure 1 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Figure 2 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Figure 3 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Figure 4 for NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Viaarxiv icon

STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs

Add code
Aug 03, 2024
Viaarxiv icon

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Add code
Jul 30, 2024
Figure 1 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 2 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 3 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Figure 4 for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
Viaarxiv icon