Picture for Sibei Yang

Sibei Yang

SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model

Add code
Dec 02, 2024
Figure 1 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 2 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 3 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Figure 4 for SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Viaarxiv icon

Plain-Det: A Plain Multi-Dataset Object Detector

Add code
Jul 14, 2024
Viaarxiv icon

Part2Object: Hierarchical Unsupervised 3D Instance Segmentation

Add code
Jul 14, 2024
Viaarxiv icon

Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation

Add code
Apr 18, 2024
Viaarxiv icon

The devil is in the object boundary: towards annotation-free instance segmentation using Foundation Models

Add code
Apr 18, 2024
Viaarxiv icon

RealDex: Towards Human-like Grasping for Robotic Dexterous Hand

Add code
Feb 21, 2024
Figure 1 for RealDex: Towards Human-like Grasping for Robotic Dexterous Hand
Figure 2 for RealDex: Towards Human-like Grasping for Robotic Dexterous Hand
Figure 3 for RealDex: Towards Human-like Grasping for Robotic Dexterous Hand
Figure 4 for RealDex: Towards Human-like Grasping for Robotic Dexterous Hand
Viaarxiv icon

OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers

Add code
Dec 18, 2023
Viaarxiv icon

TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition

Add code
Oct 30, 2023
Viaarxiv icon

DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models

Add code
Oct 26, 2023
Viaarxiv icon

Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator

Add code
Sep 25, 2023
Viaarxiv icon