Picture for Andrew Zisserman

Andrew Zisserman

DeepMind

A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos

Add code
Nov 13, 2024
Viaarxiv icon

Automated Spinal MRI Labelling from Reports Using a Large Language Model

Add code
Oct 22, 2024
Viaarxiv icon

It's Just Another Day: Unique Video Captioning by Discriminative Prompting

Add code
Oct 15, 2024
Figure 1 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 2 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 3 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Figure 4 for It's Just Another Day: Unique Video Captioning by Discriminative Prompting
Viaarxiv icon

Character-aware audio-visual subtitling in context

Add code
Oct 14, 2024
Viaarxiv icon

The VoxCeleb Speaker Recognition Challenge: A Retrospective

Add code
Aug 27, 2024
Viaarxiv icon

3D-Aware Instance Segmentation and Tracking in Egocentric Videos

Add code
Aug 19, 2024
Viaarxiv icon

Tails Tell Tales: Chapter-Wide Manga Transcriptions with Character Names

Add code
Aug 01, 2024
Viaarxiv icon

OVR: A Dataset for Open Vocabulary Temporal Repetition Counting in Videos

Add code
Jul 24, 2024
Viaarxiv icon

AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description

Add code
Jul 22, 2024
Viaarxiv icon

TAPVid-3D: A Benchmark for Tracking Any Point in 3D

Add code
Jul 08, 2024
Viaarxiv icon