Picture for Vijay Vasudevan

Vijay Vasudevan

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Viaarxiv icon

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Add code
Jun 22, 2022
Figure 1 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 2 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 3 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 4 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Viaarxiv icon

When does dough become a bagel? Analyzing the remaining mistakes on ImageNet

Add code
May 09, 2022
Figure 1 for When does dough become a bagel? Analyzing the remaining mistakes on ImageNet
Figure 2 for When does dough become a bagel? Analyzing the remaining mistakes on ImageNet
Figure 3 for When does dough become a bagel? Analyzing the remaining mistakes on ImageNet
Figure 4 for When does dough become a bagel? Analyzing the remaining mistakes on ImageNet
Viaarxiv icon

CoCa: Contrastive Captioners are Image-Text Foundation Models

Add code
May 04, 2022
Figure 1 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 2 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 3 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 4 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Viaarxiv icon

To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels

Add code
Jun 25, 2021
Figure 1 for To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
Figure 2 for To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
Figure 3 for To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
Figure 4 for To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
Viaarxiv icon

Scene Transformer: A unified multi-task model for behavior prediction and planning

Add code
Jun 15, 2021
Figure 1 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 2 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 3 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Figure 4 for Scene Transformer: A unified multi-task model for behavior prediction and planning
Viaarxiv icon

Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset

Add code
Apr 20, 2021
Figure 1 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 2 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 3 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Figure 4 for Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Viaarxiv icon

Pseudo-labeling for Scalable 3D Object Detection

Add code
Mar 02, 2021
Figure 1 for Pseudo-labeling for Scalable 3D Object Detection
Figure 2 for Pseudo-labeling for Scalable 3D Object Detection
Figure 3 for Pseudo-labeling for Scalable 3D Object Detection
Figure 4 for Pseudo-labeling for Scalable 3D Object Detection
Viaarxiv icon

Streaming Object Detection for 3-D Point Clouds

Add code
May 04, 2020
Figure 1 for Streaming Object Detection for 3-D Point Clouds
Figure 2 for Streaming Object Detection for 3-D Point Clouds
Figure 3 for Streaming Object Detection for 3-D Point Clouds
Figure 4 for Streaming Object Detection for 3-D Point Clouds
Viaarxiv icon