Picture for David Crandall

David Crandall

Transformer for Object Re-Identification: A Survey

Add code
Jan 13, 2024
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Situated Cameras, Situated Knowledges: Towards an Egocentric Epistemology for Computer Vision

Add code
Jun 30, 2023
Figure 1 for Situated Cameras, Situated Knowledges: Towards an Egocentric Epistemology for Computer Vision
Viaarxiv icon

A Tensor-based Convolutional Neural Network for Small Dataset Classification

Add code
Mar 29, 2023
Figure 1 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 2 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 3 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 4 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Viaarxiv icon

SePaint: Semantic Map Inpainting via Multinomial Diffusion

Add code
Mar 05, 2023
Figure 1 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 2 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 3 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 4 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Viaarxiv icon

LoCoNet: Long-Short Context Network for Active Speaker Detection

Add code
Jan 19, 2023
Figure 1 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 2 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 3 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 4 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Viaarxiv icon

VindLU: A Recipe for Effective Video-and-Language Pretraining

Add code
Dec 09, 2022
Figure 1 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 2 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 3 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 4 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Viaarxiv icon

Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper

Add code
Sep 22, 2022
Figure 1 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Figure 2 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Figure 3 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Figure 4 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Viaarxiv icon

Action Recognition based on Cross-Situational Action-object Statistics

Add code
Aug 15, 2022
Figure 1 for Action Recognition based on Cross-Situational Action-object Statistics
Figure 2 for Action Recognition based on Cross-Situational Action-object Statistics
Figure 3 for Action Recognition based on Cross-Situational Action-object Statistics
Figure 4 for Action Recognition based on Cross-Situational Action-object Statistics
Viaarxiv icon

Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds

Add code
Jul 26, 2022
Figure 1 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Figure 2 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Figure 3 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Figure 4 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Viaarxiv icon