Picture for Andrew Zisserman

Andrew Zisserman

DeepMind

Learning from Streaming Video with Orthogonal Gradients

Add code
Apr 02, 2025
Viaarxiv icon

Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation

Add code
Apr 01, 2025
Viaarxiv icon

From Panels to Prose: Generating Literary Narratives from Comics

Add code
Mar 30, 2025
Viaarxiv icon

Understanding Co-speech Gestures in-the-wild

Add code
Mar 28, 2025
Viaarxiv icon

ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval

Add code
Feb 21, 2025
Viaarxiv icon

Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues

Add code
Jan 16, 2025
Figure 1 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Figure 2 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Figure 3 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Figure 4 for Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
Viaarxiv icon

VoiceVector: Multimodal Enrolment Vectors for Speaker Separation

Add code
Jan 02, 2025
Figure 1 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 2 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 3 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Figure 4 for VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Viaarxiv icon

Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation

Add code
Jan 02, 2025
Viaarxiv icon

Scaling 4D Representations

Add code
Dec 19, 2024
Figure 1 for Scaling 4D Representations
Figure 2 for Scaling 4D Representations
Figure 3 for Scaling 4D Representations
Figure 4 for Scaling 4D Representations
Viaarxiv icon

New keypoint-based approach for recognising British Sign Language (BSL) from sequences

Add code
Dec 12, 2024
Figure 1 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Figure 2 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Figure 3 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Figure 4 for New keypoint-based approach for recognising British Sign Language (BSL) from sequences
Viaarxiv icon