Picture for Zhuoyang Zhang

Zhuoyang Zhang

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Add code
Oct 14, 2024
Figure 1 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 2 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 3 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Figure 4 for HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Viaarxiv icon

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Add code
Sep 06, 2024
Figure 1 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 2 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 3 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 4 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Viaarxiv icon

Sparse Refinement for Efficient High-Resolution Semantic Segmentation

Add code
Jul 26, 2024
Viaarxiv icon

Condition-Aware Neural Network for Controlled Image Generation

Add code
Apr 01, 2024
Figure 1 for Condition-Aware Neural Network for Controlled Image Generation
Figure 2 for Condition-Aware Neural Network for Controlled Image Generation
Figure 3 for Condition-Aware Neural Network for Controlled Image Generation
Figure 4 for Condition-Aware Neural Network for Controlled Image Generation
Viaarxiv icon

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Add code
Feb 07, 2024
Viaarxiv icon

One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion

Add code
Nov 14, 2023
Figure 1 for One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Figure 2 for One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Figure 3 for One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Figure 4 for One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Viaarxiv icon

Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model

Add code
Oct 23, 2023
Figure 1 for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Figure 2 for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Figure 3 for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Figure 4 for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Viaarxiv icon

NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding

Add code
Oct 12, 2023
Viaarxiv icon

Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning

Add code
Dec 20, 2022
Viaarxiv icon