Picture for Andrew Zisserman

Andrew Zisserman

DeepMind

Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues

Add code
Jan 16, 2025
Figure 1 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Figure 2 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Figure 3 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Figure 4 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Viaarxiv icon

VoiceVector: Multimodal Enrolment Vectors for Speaker Separation

Add code
Jan 02, 2025
Figure 1 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 2 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 3 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 4 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Viaarxiv icon

Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation

Add code
Jan 02, 2025
Viaarxiv icon

Scaling 4D Representations

Add code
Dec 19, 2024
Figure 1 for Scaling 4D Representations
Figure 2 for Scaling 4D Representations
Figure 3 for Scaling 4D Representations
Figure 4 for Scaling 4D Representations
Viaarxiv icon

New keypoint-based approach for recognising British Sign Language (BSL) from sequences

Add code
Dec 12, 2024
Figure 1 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Figure 2 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Figure 3 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Figure 4 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Viaarxiv icon

3D Spine Shape Estimation from Single 2D DXA

Add code
Dec 02, 2024
Viaarxiv icon

Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark

Add code
Nov 29, 2024
Viaarxiv icon

The Sound of Water: Inferring Physical Properties from Pouring Liquids

Add code
Nov 18, 2024
Viaarxiv icon

A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos

Add code
Nov 13, 2024
Figure 1 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Figure 2 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Figure 3 for A Short Note on Evaluating RepNet for Temporal Repetition Counting in Videos
Viaarxiv icon

Automated Spinal MRI Labelling from Reports Using a Large Language Model

Add code
Oct 22, 2024
Viaarxiv icon