Picture for Wentao Yuan

Wentao Yuan

Gemini Robotics: Bringing AI into the Physical World

Add code
Mar 25, 2025
Viaarxiv icon

AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation

Add code
Oct 01, 2024
Figure 1 for AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
Figure 2 for AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
Figure 3 for AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
Figure 4 for AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
Viaarxiv icon

Manipulate-Anything: Automating Real-World Robots using Vision-Language Models

Add code
Jun 27, 2024
Viaarxiv icon

RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics

Add code
Jun 15, 2024
Viaarxiv icon

M2T2: Multi-Task Masked Transformer for Object-centric Pick and Place

Add code
Nov 02, 2023
Viaarxiv icon

Evaluating Robustness of Visual Representations for Object Assembly Task Requiring Spatio-Geometrical Reasoning

Add code
Oct 22, 2023
Viaarxiv icon

TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation

Add code
Mar 28, 2023
Figure 1 for TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation
Figure 2 for TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation
Figure 3 for TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation
Figure 4 for TerrainNet: Visual Modeling of Complex Terrain for High-speed, Off-road Navigation
Viaarxiv icon

KD-MVS: Knowledge Distillation Based Self-supervised Learning for MVS

Add code
Jul 21, 2022
Figure 1 for KD-MVS: Knowledge Distillation Based Self-supervised Learning for MVS
Figure 2 for KD-MVS: Knowledge Distillation Based Self-supervised Learning for MVS
Figure 3 for KD-MVS: Knowledge Distillation Based Self-supervised Learning for MVS
Figure 4 for KD-MVS: Knowledge Distillation Based Self-supervised Learning for MVS
Viaarxiv icon

Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives

Add code
Jul 21, 2022
Figure 1 for Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
Figure 2 for Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
Figure 3 for Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
Figure 4 for Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives
Viaarxiv icon

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers

Add code
Nov 29, 2021
Figure 1 for TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Figure 2 for TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Figure 3 for TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Figure 4 for TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers
Viaarxiv icon