Picture for Carl Vondrick

Carl Vondrick

MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting

Add code
Dec 04, 2024
Figure 1 for MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting
Figure 2 for MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting
Figure 3 for MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting
Figure 4 for MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting
Viaarxiv icon

Self-Improving Autonomous Underwater Manipulation

Add code
Oct 24, 2024
Figure 1 for Self-Improving Autonomous Underwater Manipulation
Figure 2 for Self-Improving Autonomous Underwater Manipulation
Figure 3 for Self-Improving Autonomous Underwater Manipulation
Figure 4 for Self-Improving Autonomous Underwater Manipulation
Viaarxiv icon

Differentiable Robot Rendering

Add code
Oct 17, 2024
Figure 1 for Differentiable Robot Rendering
Figure 2 for Differentiable Robot Rendering
Figure 3 for Differentiable Robot Rendering
Figure 4 for Differentiable Robot Rendering
Viaarxiv icon

EraseDraw: Learning to Insert Objects by Erasing Them from Images

Add code
Aug 31, 2024
Figure 1 for EraseDraw: Learning to Insert Objects by Erasing Them from Images
Figure 2 for EraseDraw: Learning to Insert Objects by Erasing Them from Images
Figure 3 for EraseDraw: Learning to Insert Objects by Erasing Them from Images
Figure 4 for EraseDraw: Learning to Insert Objects by Erasing Them from Images
Viaarxiv icon

Controlling the World by Sleight of Hand

Add code
Aug 13, 2024
Figure 1 for Controlling the World by Sleight of Hand
Figure 2 for Controlling the World by Sleight of Hand
Figure 3 for Controlling the World by Sleight of Hand
Figure 4 for Controlling the World by Sleight of Hand
Viaarxiv icon

Dreamitate: Real-World Visuomotor Policy Learning via Video Generation

Add code
Jun 24, 2024
Figure 1 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Figure 2 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Figure 3 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Figure 4 for Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Viaarxiv icon

Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

Add code
Jun 20, 2024
Figure 1 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Figure 2 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Figure 3 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Figure 4 for Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Viaarxiv icon

See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding

Add code
Jun 17, 2024
Figure 1 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Figure 2 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Figure 3 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Figure 4 for See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
Viaarxiv icon

How Video Meetings Change Your Expression

Add code
Jun 03, 2024
Viaarxiv icon

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

Add code
May 23, 2024
Viaarxiv icon