Picture for Naomi Harte

Naomi Harte

Uncovering the Visual Contribution in Audio-Visual Speech Recognition

Add code
Dec 22, 2024
Viaarxiv icon

Noise-Robust Hearing Aid Voice Control

Add code
Nov 05, 2024
Viaarxiv icon

Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation

Add code
Feb 20, 2023
Figure 1 for Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation
Figure 2 for Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation
Figure 3 for Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation
Figure 4 for Learnable Frontends that do not Learn: Quantifying Sensitivity to Filterbank Initialisation
Viaarxiv icon

Learnable Acoustic Frontends in Bird Activity Detection

Add code
Oct 03, 2022
Figure 1 for Learnable Acoustic Frontends in Bird Activity Detection
Figure 2 for Learnable Acoustic Frontends in Bird Activity Detection
Figure 3 for Learnable Acoustic Frontends in Bird Activity Detection
Figure 4 for Learnable Acoustic Frontends in Bird Activity Detection
Viaarxiv icon

Low Resource Species Agnostic Bird Activity Detection

Add code
Dec 16, 2021
Figure 1 for Low Resource Species Agnostic Bird Activity Detection
Figure 2 for Low Resource Species Agnostic Bird Activity Detection
Figure 3 for Low Resource Species Agnostic Bird Activity Detection
Figure 4 for Low Resource Species Agnostic Bird Activity Detection
Viaarxiv icon

Bioacoustic Event Detection with prototypical networks and data augmentation

Add code
Dec 16, 2021
Figure 1 for Bioacoustic Event Detection with prototypical networks and data augmentation
Figure 2 for Bioacoustic Event Detection with prototypical networks and data augmentation
Figure 3 for Bioacoustic Event Detection with prototypical networks and data augmentation
Figure 4 for Bioacoustic Event Detection with prototypical networks and data augmentation
Viaarxiv icon

AV Taris: Online Audio-Visual Speech Recognition

Add code
Dec 14, 2020
Figure 1 for AV Taris: Online Audio-Visual Speech Recognition
Figure 2 for AV Taris: Online Audio-Visual Speech Recognition
Figure 3 for AV Taris: Online Audio-Visual Speech Recognition
Figure 4 for AV Taris: Online Audio-Visual Speech Recognition
Viaarxiv icon

Deep Multi-Scale Feature Learning for Defocus Blur Estimation

Add code
Sep 24, 2020
Figure 1 for Deep Multi-Scale Feature Learning for Defocus Blur Estimation
Figure 2 for Deep Multi-Scale Feature Learning for Defocus Blur Estimation
Figure 3 for Deep Multi-Scale Feature Learning for Defocus Blur Estimation
Figure 4 for Deep Multi-Scale Feature Learning for Defocus Blur Estimation
Viaarxiv icon

Learning to Count Words in Fluent Speech enables Online Speech Recognition

Add code
Jun 11, 2020
Figure 1 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 2 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 3 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 4 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Viaarxiv icon

Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition

Add code
May 19, 2020
Figure 1 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Figure 2 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Figure 3 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Figure 4 for Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
Viaarxiv icon