Picture for Kalyan Vasudev Alwala

Kalyan Vasudev Alwala

SAM 2: Segment Anything in Images and Videos

Add code
Aug 01, 2024
Figure 1 for SAM 2: Segment Anything in Images and Videos
Figure 2 for SAM 2: Segment Anything in Images and Videos
Figure 3 for SAM 2: Segment Anything in Images and Videos
Figure 4 for SAM 2: Segment Anything in Images and Videos
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

Add code
May 09, 2023
Viaarxiv icon

The effectiveness of MAE pre-pretraining for billion-scale pretraining

Add code
Mar 23, 2023
Figure 1 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 2 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 3 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Figure 4 for The effectiveness of MAE pre-pretraining for billion-scale pretraining
Viaarxiv icon

OmniMAE: Single Model Masked Pretraining on Images and Videos

Add code
Jun 16, 2022
Figure 1 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 2 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 3 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Figure 4 for OmniMAE: Single Model Masked Pretraining on Images and Videos
Viaarxiv icon

Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction

Add code
Apr 07, 2022
Figure 1 for Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
Figure 2 for Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
Figure 3 for Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
Figure 4 for Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
Viaarxiv icon

PyTorchVideo: A Deep Learning Library for Video Understanding

Add code
Nov 18, 2021
Figure 1 for PyTorchVideo: A Deep Learning Library for Video Understanding
Figure 2 for PyTorchVideo: A Deep Learning Library for Video Understanding
Figure 3 for PyTorchVideo: A Deep Learning Library for Video Understanding
Viaarxiv icon

Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

Add code
May 28, 2021
Figure 1 for Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation
Figure 2 for Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation
Figure 3 for Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation
Figure 4 for Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation
Viaarxiv icon

Joint Sampling and Trajectory Optimization over Graphs for Online Motion Planning

Add code
Nov 13, 2020
Figure 1 for Joint Sampling and Trajectory Optimization over Graphs for Online Motion Planning
Figure 2 for Joint Sampling and Trajectory Optimization over Graphs for Online Motion Planning
Figure 3 for Joint Sampling and Trajectory Optimization over Graphs for Online Motion Planning
Figure 4 for Joint Sampling and Trajectory Optimization over Graphs for Online Motion Planning
Viaarxiv icon

PyRobot: An Open-source Robotics Framework for Research and Benchmarking

Add code
Jun 19, 2019
Figure 1 for PyRobot: An Open-source Robotics Framework for Research and Benchmarking
Figure 2 for PyRobot: An Open-source Robotics Framework for Research and Benchmarking
Figure 3 for PyRobot: An Open-source Robotics Framework for Research and Benchmarking
Figure 4 for PyRobot: An Open-source Robotics Framework for Research and Benchmarking
Viaarxiv icon