Picture for David F. Fouhey

David F. Fouhey

Multi-Object Hallucination in Vision-Language Models

Add code
Jul 08, 2024
Figure 1 for Multi-Object Hallucination in Vision-Language Models
Figure 2 for Multi-Object Hallucination in Vision-Language Models
Figure 3 for Multi-Object Hallucination in Vision-Language Models
Figure 4 for Multi-Object Hallucination in Vision-Language Models
Viaarxiv icon

3D-MVP: 3D Multiview Pretraining for Robotic Manipulation

Add code
Jun 26, 2024
Figure 1 for 3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
Figure 2 for 3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
Figure 3 for 3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
Figure 4 for 3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
Viaarxiv icon

3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination

Add code
Jun 12, 2024
Figure 1 for 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Figure 2 for 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Figure 3 for 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Figure 4 for 3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination
Viaarxiv icon

3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs

Add code
Jun 07, 2024
Figure 1 for 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
Figure 2 for 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
Figure 3 for 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
Figure 4 for 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs
Viaarxiv icon

FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation

Add code
Mar 05, 2024
Viaarxiv icon

LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent

Add code
Sep 21, 2023
Viaarxiv icon

Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data

Add code
Jun 14, 2023
Figure 1 for Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data
Figure 2 for Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data
Figure 3 for Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data
Figure 4 for Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data
Viaarxiv icon

Understanding 3D Object Interaction from a Single Image

Add code
May 16, 2023
Figure 1 for Understanding 3D Object Interaction from a Single Image
Figure 2 for Understanding 3D Object Interaction from a Single Image
Figure 3 for Understanding 3D Object Interaction from a Single Image
Figure 4 for Understanding 3D Object Interaction from a Single Image
Viaarxiv icon

Perspective Fields for Single Image Camera Calibration

Add code
Dec 06, 2022
Figure 1 for Perspective Fields for Single Image Camera Calibration
Figure 2 for Perspective Fields for Single Image Camera Calibration
Figure 3 for Perspective Fields for Single Image Camera Calibration
Figure 4 for Perspective Fields for Single Image Camera Calibration
Viaarxiv icon

Large-Scale Spatial Cross-Calibration of Hinode/SOT-SP and SDO/HMI

Add code
Sep 29, 2022
Figure 1 for Large-Scale Spatial Cross-Calibration of Hinode/SOT-SP and SDO/HMI
Figure 2 for Large-Scale Spatial Cross-Calibration of Hinode/SOT-SP and SDO/HMI
Figure 3 for Large-Scale Spatial Cross-Calibration of Hinode/SOT-SP and SDO/HMI
Figure 4 for Large-Scale Spatial Cross-Calibration of Hinode/SOT-SP and SDO/HMI
Viaarxiv icon