Picture for Du Tran

Du Tran

Learning Space-Time Semantic Correspondences

Add code
Jun 16, 2023
Viaarxiv icon

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision

Add code
Mar 09, 2023
Viaarxiv icon

MINOTAUR: Multi-task Video Grounding From Multimodal Queries

Add code
Feb 16, 2023
Viaarxiv icon

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity

Add code
Apr 12, 2022
Figure 1 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Figure 2 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Figure 3 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Figure 4 for Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity
Viaarxiv icon

Long-Short Temporal Contrastive Learning of Video Transformers

Add code
Jul 08, 2021
Figure 1 for Long-Short Temporal Contrastive Learning of Video Transformers
Figure 2 for Long-Short Temporal Contrastive Learning of Video Transformers
Figure 3 for Long-Short Temporal Contrastive Learning of Video Transformers
Figure 4 for Long-Short Temporal Contrastive Learning of Video Transformers
Viaarxiv icon

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation

Add code
Apr 10, 2021
Figure 1 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Figure 2 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Figure 3 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Figure 4 for Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Viaarxiv icon

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation

Add code
Dec 15, 2020
Figure 1 for FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
Figure 2 for FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
Figure 3 for FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
Figure 4 for FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
Viaarxiv icon

Self-Supervised Learning by Cross-Modal Audio-Video Clustering

Add code
Nov 28, 2019
Figure 1 for Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Figure 2 for Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Figure 3 for Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Figure 4 for Self-Supervised Learning by Cross-Modal Audio-Video Clustering
Viaarxiv icon

UniDual: A Unified Model for Image and Video Understanding

Add code
Jun 12, 2019
Figure 1 for UniDual: A Unified Model for Image and Video Understanding
Figure 2 for UniDual: A Unified Model for Image and Video Understanding
Figure 3 for UniDual: A Unified Model for Image and Video Understanding
Figure 4 for UniDual: A Unified Model for Image and Video Understanding
Viaarxiv icon

FASTER Recurrent Networks for Video Classification

Add code
Jun 10, 2019
Figure 1 for FASTER Recurrent Networks for Video Classification
Figure 2 for FASTER Recurrent Networks for Video Classification
Figure 3 for FASTER Recurrent Networks for Video Classification
Figure 4 for FASTER Recurrent Networks for Video Classification
Viaarxiv icon