Picture for Song Bai

Song Bai

Alibaba Group, University of Oxford

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Add code
Jul 23, 2024
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Figure 1 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 2 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 3 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 4 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Viaarxiv icon

DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data

Add code
Jun 07, 2024
Viaarxiv icon

Debiasing Text-to-Image Diffusion Models

Add code
Feb 22, 2024
Viaarxiv icon

Progress and Prospects in 3D Generative AI: A Technical Overview including 3D human

Add code
Jan 05, 2024
Viaarxiv icon

General Object Foundation Model for Images and Videos at Scale

Add code
Dec 14, 2023
Viaarxiv icon

Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery

Add code
Dec 05, 2023
Viaarxiv icon

Dataset Condensation via Generative Model

Add code
Sep 14, 2023
Viaarxiv icon

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks

Add code
Aug 13, 2023
Viaarxiv icon

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding

Add code
Aug 01, 2023
Viaarxiv icon