Picture for Mingxu Zhang

Mingxu Zhang

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping

Add code
May 30, 2025
Viaarxiv icon

CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation

Add code
May 04, 2025
Viaarxiv icon

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

Add code
Dec 24, 2023
Figure 1 for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation
Figure 2 for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation
Figure 3 for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation
Figure 4 for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation
Viaarxiv icon