Picture for Chao-Yuan Wu

Chao-Yuan Wu

SAM 2: Segment Anything in Images and Videos

Add code
Aug 01, 2024
Figure 1 for SAM 2: Segment Anything in Images and Videos
Figure 2 for SAM 2: Segment Anything in Images and Videos
Figure 3 for SAM 2: Segment Anything in Images and Videos
Figure 4 for SAM 2: Segment Anything in Images and Videos
Viaarxiv icon

PointInfinity: Resolution-Invariant Point Diffusion Models

Add code
Apr 04, 2024
Viaarxiv icon

Reversible Vision Transformers

Add code
Feb 09, 2023
Viaarxiv icon

Multiview Compressive Coding for 3D Reconstruction

Add code
Jan 19, 2023
Figure 1 for Multiview Compressive Coding for 3D Reconstruction
Figure 2 for Multiview Compressive Coding for 3D Reconstruction
Figure 3 for Multiview Compressive Coding for 3D Reconstruction
Figure 4 for Multiview Compressive Coding for 3D Reconstruction
Viaarxiv icon

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

Add code
Jan 20, 2022
Figure 1 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 2 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 3 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Figure 4 for MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Viaarxiv icon

A ConvNet for the 2020s

Add code
Jan 10, 2022
Figure 1 for A ConvNet for the 2020s
Figure 2 for A ConvNet for the 2020s
Figure 3 for A ConvNet for the 2020s
Figure 4 for A ConvNet for the 2020s
Viaarxiv icon

Masked Feature Prediction for Self-Supervised Visual Pre-Training

Add code
Dec 16, 2021
Figure 1 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 2 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 3 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Figure 4 for Masked Feature Prediction for Self-Supervised Visual Pre-Training
Viaarxiv icon

Improved Multiscale Vision Transformers for Classification and Detection

Add code
Dec 02, 2021
Figure 1 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 2 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 3 for Improved Multiscale Vision Transformers for Classification and Detection
Figure 4 for Improved Multiscale Vision Transformers for Classification and Detection
Viaarxiv icon

Towards Long-Form Video Understanding

Add code
Jun 21, 2021
Figure 1 for Towards Long-Form Video Understanding
Figure 2 for Towards Long-Form Video Understanding
Figure 3 for Towards Long-Form Video Understanding
Figure 4 for Towards Long-Form Video Understanding
Viaarxiv icon

Memory Optimization for Deep Networks

Add code
Oct 29, 2020
Figure 1 for Memory Optimization for Deep Networks
Figure 2 for Memory Optimization for Deep Networks
Figure 3 for Memory Optimization for Deep Networks
Figure 4 for Memory Optimization for Deep Networks
Viaarxiv icon