Picture for Liang-Yan Gui

Liang-Yan Gui

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Add code
Oct 10, 2024
Viaarxiv icon

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

Add code
Sep 05, 2024
Viaarxiv icon

Floating No More: Object-Ground Reconstruction from a Single Image

Add code
Jul 26, 2024
Viaarxiv icon

Situational Awareness Matters in 3D Vision Language Reasoning

Add code
Jun 11, 2024
Viaarxiv icon

SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

Add code
Apr 18, 2024
Viaarxiv icon

InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction

Add code
Mar 28, 2024
Viaarxiv icon

HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

Add code
Feb 05, 2024
Viaarxiv icon

Aligning Large Multimodal Models with Factually Augmented RLHF

Add code
Sep 25, 2023
Viaarxiv icon

InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion

Add code
Aug 31, 2023
Viaarxiv icon

Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation

Add code
Aug 17, 2023
Viaarxiv icon