Picture for David Crandall

David Crandall

TimeRefine: Temporal Grounding with Time Refining Video LLM

Add code
Dec 12, 2024
Viaarxiv icon

Multi-resolution Guided 3D GANs for Medical Image Translation

Add code
Nov 30, 2024
Figure 1 for Multi-resolution Guided 3D GANs for Medical Image Translation
Figure 2 for Multi-resolution Guided 3D GANs for Medical Image Translation
Figure 3 for Multi-resolution Guided 3D GANs for Medical Image Translation
Figure 4 for Multi-resolution Guided 3D GANs for Medical Image Translation
Viaarxiv icon

Case-Enhanced Vision Transformer: Improving Explanations of Image Similarity with a ViT-based Similarity Metric

Add code
Jul 24, 2024
Figure 1 for Case-Enhanced Vision Transformer: Improving Explanations of Image Similarity with a ViT-based Similarity Metric
Figure 2 for Case-Enhanced Vision Transformer: Improving Explanations of Image Similarity with a ViT-based Similarity Metric
Figure 3 for Case-Enhanced Vision Transformer: Improving Explanations of Image Similarity with a ViT-based Similarity Metric
Figure 4 for Case-Enhanced Vision Transformer: Improving Explanations of Image Similarity with a ViT-based Similarity Metric
Viaarxiv icon

Transformer for Object Re-Identification: A Survey

Add code
Jan 13, 2024
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Situated Cameras, Situated Knowledges: Towards an Egocentric Epistemology for Computer Vision

Add code
Jun 30, 2023
Viaarxiv icon

A Tensor-based Convolutional Neural Network for Small Dataset Classification

Add code
Mar 29, 2023
Figure 1 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 2 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 3 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 4 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Viaarxiv icon

SePaint: Semantic Map Inpainting via Multinomial Diffusion

Add code
Mar 05, 2023
Figure 1 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 2 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 3 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 4 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Viaarxiv icon

LoCoNet: Long-Short Context Network for Active Speaker Detection

Add code
Jan 19, 2023
Figure 1 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 2 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 3 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 4 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Viaarxiv icon

VindLU: A Recipe for Effective Video-and-Language Pretraining

Add code
Dec 09, 2022
Figure 1 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 2 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 3 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 4 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Viaarxiv icon