Picture for Kwan-Yee K. Wong

Kwan-Yee K. Wong

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Add code
Oct 25, 2024
Figure 1 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 2 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 3 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Figure 4 for FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Viaarxiv icon

BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Add code
Oct 18, 2024
Viaarxiv icon

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Add code
Oct 09, 2024
Figure 1 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 2 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 3 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 4 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Viaarxiv icon

ArtiFade: Learning to Generate High-quality Subject from Blemished Images

Add code
Sep 05, 2024
Figure 1 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Figure 2 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Figure 3 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Figure 4 for ArtiFade: Learning to Generate High-quality Subject from Blemished Images
Viaarxiv icon

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Add code
Jul 09, 2024
Viaarxiv icon

Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation

Add code
Jul 08, 2024
Figure 1 for Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Figure 2 for Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Figure 3 for Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Figure 4 for Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Viaarxiv icon

A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation

Add code
Jun 06, 2024
Figure 1 for A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation
Figure 2 for A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation
Figure 3 for A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation
Figure 4 for A Survey on 3D Human Avatar Modeling -- From Reconstruction to Generation
Viaarxiv icon

Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Add code
Mar 12, 2024
Viaarxiv icon

PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis

Add code
Mar 04, 2024
Viaarxiv icon

MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation

Add code
Jan 14, 2024
Figure 1 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Figure 2 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Figure 3 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Figure 4 for MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation
Viaarxiv icon