Picture for Yin Cui

Yin Cui

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Add code
Nov 11, 2024
Viaarxiv icon

Edify 3D: Scalable High-Quality 3D Asset Generation

Add code
Nov 11, 2024
Figure 1 for Edify 3D: Scalable High-Quality 3D Asset Generation
Figure 2 for Edify 3D: Scalable High-Quality 3D Asset Generation
Figure 3 for Edify 3D: Scalable High-Quality 3D Asset Generation
Figure 4 for Edify 3D: Scalable High-Quality 3D Asset Generation
Viaarxiv icon

Why Fine-grained Labels in Pretraining Benefit Generalization?

Add code
Oct 30, 2024
Viaarxiv icon

Wolf: Captioning Everything with a World Summarization Framework

Add code
Jul 26, 2024
Figure 1 for Wolf: Captioning Everything with a World Summarization Framework
Figure 2 for Wolf: Captioning Everything with a World Summarization Framework
Figure 3 for Wolf: Captioning Everything with a World Summarization Framework
Figure 4 for Wolf: Captioning Everything with a World Summarization Framework
Viaarxiv icon

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

Add code
Apr 30, 2024
Viaarxiv icon

Module-wise Adaptive Distillation for Multimodality Foundation Models

Add code
Oct 06, 2023
Viaarxiv icon

VideoGLUE: Video General Understanding Evaluation of Foundation Models

Add code
Jul 06, 2023
Viaarxiv icon

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Add code
Jun 02, 2023
Figure 1 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 2 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 3 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 4 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Viaarxiv icon

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception

Add code
May 10, 2023
Viaarxiv icon

Towards Understanding the Effect of Pretraining Label Granularity

Add code
Mar 29, 2023
Viaarxiv icon