Picture for Hangjie Yuan

Hangjie Yuan

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Add code
Dec 12, 2024
Viaarxiv icon

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control

Add code
Oct 17, 2024
Figure 1 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 2 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 3 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Figure 4 for DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
Viaarxiv icon

EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

Add code
Oct 10, 2024
Figure 1 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 2 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 3 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Figure 4 for EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models
Viaarxiv icon

FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing

Add code
Sep 30, 2024
Figure 1 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Figure 2 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Figure 3 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Figure 4 for FreeMask: Rethinking the Importance of Attention Masks for Zero-Shot Video Editing
Viaarxiv icon

PAPM: A Physics-aware Proxy Model for Process Systems

Add code
Jul 07, 2024
Viaarxiv icon

Revisiting Neural Networks for Continual Learning: An Architectural Perspective

Add code
Apr 28, 2024
Viaarxiv icon

Make Continual Learning Stronger via C-Flat

Add code
Apr 01, 2024
Viaarxiv icon

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition

Add code
Mar 03, 2024
Figure 1 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 2 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 3 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 4 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Viaarxiv icon

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

Add code
Dec 25, 2023
Figure 1 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Figure 2 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Figure 3 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Figure 4 for A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
Viaarxiv icon

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Add code
Dec 19, 2023
Figure 1 for InstructVideo: Instructing Video Diffusion Models with Human Feedback
Figure 2 for InstructVideo: Instructing Video Diffusion Models with Human Feedback
Figure 3 for InstructVideo: Instructing Video Diffusion Models with Human Feedback
Figure 4 for InstructVideo: Instructing Video Diffusion Models with Human Feedback
Viaarxiv icon