Picture for Jonathon Shlens

Jonathon Shlens

Learning Visual Composition through Improved Semantic Guidance

Add code
Dec 19, 2024
Figure 1 for Learning Visual Composition through Improved Semantic Guidance
Figure 2 for Learning Visual Composition through Improved Semantic Guidance
Figure 3 for Learning Visual Composition through Improved Semantic Guidance
Figure 4 for Learning Visual Composition through Improved Semantic Guidance
Viaarxiv icon

Towards flexible perception with visual memory

Add code
Aug 15, 2024
Figure 1 for Towards flexible perception with visual memory
Figure 2 for Towards flexible perception with visual memory
Figure 3 for Towards flexible perception with visual memory
Figure 4 for Towards flexible perception with visual memory
Viaarxiv icon

Capabilities of Gemini Models in Medicine

Add code
May 01, 2024
Figure 1 for Capabilities of Gemini Models in Medicine
Figure 2 for Capabilities of Gemini Models in Medicine
Figure 3 for Capabilities of Gemini Models in Medicine
Figure 4 for Capabilities of Gemini Models in Medicine
Viaarxiv icon

On Robustness in Multimodal Learning

Add code
Apr 11, 2023
Viaarxiv icon

STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

Add code
Feb 08, 2023
Viaarxiv icon

PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds

Add code
Oct 24, 2022
Viaarxiv icon

Perceptual Grouping in Vision-Language Models

Add code
Oct 18, 2022
Figure 1 for Perceptual Grouping in Vision-Language Models
Figure 2 for Perceptual Grouping in Vision-Language Models
Figure 3 for Perceptual Grouping in Vision-Language Models
Figure 4 for Perceptual Grouping in Vision-Language Models
Viaarxiv icon

Soft Calibration Objectives for Neural Networks

Add code
Jul 30, 2021
Figure 1 for Soft Calibration Objectives for Neural Networks
Figure 2 for Soft Calibration Objectives for Neural Networks
Figure 3 for Soft Calibration Objectives for Neural Networks
Figure 4 for Soft Calibration Objectives for Neural Networks
Viaarxiv icon

Scene Transformer: A unified multi-task model for behavior prediction and planning

Add code
Jun 15, 2021
Figure 1 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 2 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 3 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 4 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Viaarxiv icon

Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset

Add code
Apr 20, 2021
Figure 1 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 2 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 3 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 4 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Viaarxiv icon