Picture for Guandao Yang

Guandao Yang

FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution

Add code
Apr 09, 2025
Viaarxiv icon

BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing

Add code
Apr 02, 2025
Viaarxiv icon

Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction

Add code
Feb 13, 2025
Viaarxiv icon

FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

Add code
Dec 10, 2024
Figure 1 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 2 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 3 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Figure 4 for FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models
Viaarxiv icon

AIpparel: A Large Multimodal Generative Model for Digital Garments

Add code
Dec 05, 2024
Figure 1 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 2 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 3 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Figure 4 for AIpparel: A Large Multimodal Generative Model for Digital Garments
Viaarxiv icon

DiffusionPDE: Generative PDE-Solving Under Partial Observation

Add code
Jun 25, 2024
Figure 1 for DiffusionPDE: Generative PDE-Solving Under Partial Observation
Figure 2 for DiffusionPDE: Generative PDE-Solving Under Partial Observation
Figure 3 for DiffusionPDE: Generative PDE-Solving Under Partial Observation
Figure 4 for DiffusionPDE: Generative PDE-Solving Under Partial Observation
Viaarxiv icon

MegaScenes: Scene-Level View Synthesis at Scale

Add code
Jun 17, 2024
Figure 1 for MegaScenes: Scene-Level View Synthesis at Scale
Figure 2 for MegaScenes: Scene-Level View Synthesis at Scale
Figure 3 for MegaScenes: Scene-Level View Synthesis at Scale
Figure 4 for MegaScenes: Scene-Level View Synthesis at Scale
Viaarxiv icon

InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight Information Shaping

Add code
Jun 09, 2024
Viaarxiv icon

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Add code
Apr 26, 2024
Figure 1 for BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
Figure 2 for BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
Figure 3 for BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
Figure 4 for BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
Viaarxiv icon

PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

Add code
Apr 09, 2024
Figure 1 for PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations
Figure 2 for PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations
Figure 3 for PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations
Figure 4 for PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations
Viaarxiv icon