Picture for Yuhao Cheng

Yuhao Cheng

Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

Add code
Dec 19, 2024
Viaarxiv icon

EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation

Add code
Dec 06, 2024
Figure 1 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Figure 2 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Figure 3 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Figure 4 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Viaarxiv icon

Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars

Add code
Oct 11, 2024
Figure 1 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 2 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 3 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 4 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Viaarxiv icon

Revealing Directions for Text-guided 3D Face Editing

Add code
Oct 07, 2024
Figure 1 for Revealing Directions for Text-guided 3D Face Editing
Figure 2 for Revealing Directions for Text-guided 3D Face Editing
Figure 3 for Revealing Directions for Text-guided 3D Face Editing
Figure 4 for Revealing Directions for Text-guided 3D Face Editing
Viaarxiv icon

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Add code
Aug 23, 2024
Figure 1 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Figure 2 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Figure 3 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Figure 4 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Viaarxiv icon

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Add code
Jun 03, 2024
Viaarxiv icon

Rethink Predicting the Optical Flow with the Kinetics Perspective

Add code
May 21, 2024
Viaarxiv icon

TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation

Add code
Apr 29, 2024
Figure 1 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 2 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 3 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 4 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Viaarxiv icon

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

Add code
Apr 25, 2024
Figure 1 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 2 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 3 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 4 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Viaarxiv icon

Monocular Identity-Conditioned Facial Reflectance Reconstruction

Add code
Mar 30, 2024
Viaarxiv icon