Picture for Yuzhou Huang

Yuzhou Huang

Advancing Medical Radiograph Representation Learning: A Hybrid Pre-training Paradigm with Multilevel Semantic Granularity

Add code
Oct 01, 2024
Viaarxiv icon

Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models

Add code
Aug 21, 2024
Viaarxiv icon

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Add code
Mar 19, 2024
Viaarxiv icon

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Add code
Dec 11, 2023
Figure 1 for SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
Figure 2 for SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
Figure 3 for SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
Figure 4 for SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models
Viaarxiv icon