Picture for Xuweiyi Chen

Xuweiyi Chen

Learning 3D Representations from Procedural 3D Programs

Add code
Nov 25, 2024
Viaarxiv icon

Open Vocabulary Monocular 3D Object Detection

Add code
Nov 25, 2024
Viaarxiv icon

Probing the Mid-level Vision Capabilities of Self-Supervised Learning

Add code
Nov 25, 2024
Viaarxiv icon

Multi-Object Hallucination in Vision-Language Models

Add code
Jul 08, 2024
Viaarxiv icon

3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination

Add code
Jun 12, 2024
Viaarxiv icon

3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs

Add code
Jun 07, 2024
Viaarxiv icon

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified Attention Control

Add code
Mar 06, 2024
Viaarxiv icon

LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent

Add code
Sep 21, 2023
Viaarxiv icon