Picture for Yong-Lu Li

Yong-Lu Li

Interacted Object Grounding in Spatio-Temporal Human-Object Interactions

Add code
Dec 27, 2024
Viaarxiv icon

M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation

Add code
Dec 19, 2024
Figure 1 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Figure 2 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Figure 3 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Figure 4 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Viaarxiv icon

Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis

Add code
Dec 12, 2024
Figure 1 for Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
Figure 2 for Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
Figure 3 for Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
Figure 4 for Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
Viaarxiv icon

Homogeneous Dynamics Space for Heterogeneous Humans

Add code
Dec 09, 2024
Viaarxiv icon

Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models

Add code
Dec 06, 2024
Viaarxiv icon

Motion Before Action: Diffusing Object Motion as Manipulation Condition

Add code
Nov 14, 2024
Viaarxiv icon

ImDy: Human Inverse Dynamics from Imitated Observations

Add code
Oct 23, 2024
Figure 1 for ImDy: Human Inverse Dynamics from Imitated Observations
Figure 2 for ImDy: Human Inverse Dynamics from Imitated Observations
Figure 3 for ImDy: Human Inverse Dynamics from Imitated Observations
Figure 4 for ImDy: Human Inverse Dynamics from Imitated Observations
Viaarxiv icon

The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs

Add code
Oct 02, 2024
Viaarxiv icon

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Add code
Jul 29, 2024
Viaarxiv icon

DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control

Add code
Jul 20, 2024
Figure 1 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Figure 2 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Figure 3 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Figure 4 for DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control
Viaarxiv icon