Picture for Donghao Zhou

Donghao Zhou

DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis

Add code
Oct 02, 2025
Viaarxiv icon

HERO: Hierarchical Extrapolation and Refresh for Efficient World Models

Add code
Aug 25, 2025
Figure 1 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 2 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 3 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Figure 4 for HERO: Hierarchical Extrapolation and Refresh for Efficient World Models
Viaarxiv icon

Trade-offs in Image Generation: How Do Different Dimensions Interact?

Add code
Jul 29, 2025
Viaarxiv icon

CellVerse: Do Large Language Models Really Understand Cell Biology?

Add code
May 09, 2025
Figure 1 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Figure 2 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Figure 3 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Figure 4 for CellVerse: Do Large Language Models Really Understand Cell Biology?
Viaarxiv icon

An Empirical Study of GPT-4o Image Generation Capabilities

Add code
Apr 08, 2025
Viaarxiv icon

Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing

Add code
Dec 15, 2024
Figure 1 for Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
Figure 2 for Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
Figure 3 for Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
Figure 4 for Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
Viaarxiv icon

Point Cloud Understanding via Attention-Driven Contrastive Learning

Add code
Nov 22, 2024
Figure 1 for Point Cloud Understanding via Attention-Driven Contrastive Learning
Figure 2 for Point Cloud Understanding via Attention-Driven Contrastive Learning
Figure 3 for Point Cloud Understanding via Attention-Driven Contrastive Learning
Figure 4 for Point Cloud Understanding via Attention-Driven Contrastive Learning
Viaarxiv icon

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models

Add code
Oct 17, 2024
Viaarxiv icon

TripletMix: Triplet Data Augmentation for 3D Understanding

Add code
May 28, 2024
Viaarxiv icon

SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning

Add code
Jan 22, 2024
Viaarxiv icon