Picture for Shizhe Chen

Shizhe Chen

INRIA

Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy

Add code
Oct 02, 2024
Viaarxiv icon

Conan-embedding: General Text Embedding with More and Better Negative Samples

Add code
Aug 29, 2024
Viaarxiv icon

ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos

Add code
Apr 24, 2024
Viaarxiv icon

Think-Program-reCtify: 3D Situated Reasoning with Large Language Models

Add code
Apr 23, 2024
Viaarxiv icon

SUGAR: Pre-training 3D Visual Representations for Robotics

Add code
Apr 01, 2024
Viaarxiv icon

PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Add code
Sep 27, 2023
Viaarxiv icon

Explore and Tell: Embodied Visual Captioning in 3D Environments

Add code
Aug 21, 2023
Viaarxiv icon

Object Goal Navigation with Recursive Implicit Maps

Add code
Aug 10, 2023
Viaarxiv icon

Robust Visual Sim-to-Real Transfer for Robotic Manipulation

Add code
Jul 28, 2023
Viaarxiv icon

InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation

Add code
May 10, 2023
Viaarxiv icon