Picture for Andrew Zisserman

Andrew Zisserman

DeepMind

Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues

Add code
Jan 16, 2025
Viaarxiv icon

VoiceVector: Multimodal Enrolment Vectors for Speaker Separation

Add code
Jan 02, 2025
Figure 1 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 2 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 3 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 4 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Viaarxiv icon

Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation

Add code
Jan 02, 2025
Viaarxiv icon

Scaling 4D Representations

Add code
Dec 19, 2024
Figure 1 for Scaling 4D Representations
Figure 2 for Scaling 4D Representations
Figure 3 for Scaling 4D Representations
Figure 4 for Scaling 4D Representations
Viaarxiv icon

New keypoint-based approach for recognising British Sign Language (BSL) from sequences

Add code
Dec 12, 2024
Viaarxiv icon

3D Spine Shape Estimation from Single 2D DXA

Add code
Dec 02, 2024
Viaarxiv icon

Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark

Add code
Nov 29, 2024
Viaarxiv icon

The Sound of Water: Inferring Physical Properties from Pouring Liquids

Add code
Nov 18, 2024
Viaarxiv icon

A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos

Add code
Nov 13, 2024
Figure 1 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Figure 2 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Figure 3 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Viaarxiv icon

Automated Spinal MRI Labelling from Reports Using a Large Language Model

Add code
Oct 22, 2024
Viaarxiv icon