Picture for Jacob Donley

Jacob Donley

Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment

Add code
Jan 30, 2025
Figure 1 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Figure 2 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Figure 3 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Figure 4 for Efficient Audiovisual Speech Processing via MUTUD: Multimodal Training and Unimodal Deployment
Viaarxiv icon

Insights into the Incorporation of Signal Information in Binaural Signal Matching with Wearable Microphone Arrays

Add code
Sep 18, 2024
Viaarxiv icon

M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses

Add code
Sep 17, 2024
Figure 1 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 2 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 3 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 4 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Viaarxiv icon

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos

Add code
Aug 09, 2024
Figure 1 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 2 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 3 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Figure 4 for Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
Viaarxiv icon

Design and Analysis of Binaural Signal Matching with Arbitrary Microphone Arrays

Add code
Aug 07, 2024
Viaarxiv icon

Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction

Add code
Feb 27, 2024
Figure 1 for Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
Figure 2 for Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
Figure 3 for Ambisonics Encoding For Arbitrary Microphone Arrays Incorporating Residual Channels For Binaural Reproduction
Viaarxiv icon

On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement

Add code
Jan 15, 2024
Figure 1 for On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement
Figure 2 for On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement
Figure 3 for On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement
Figure 4 for On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement
Viaarxiv icon

Subspace Hybrid MVDR Beamforming for Augmented Hearing

Add code
Nov 30, 2023
Viaarxiv icon

Performance Analysis Of Binaural Signal Matching (BSM) in the Time-Frequency Domain

Add code
Nov 23, 2023
Viaarxiv icon

Subspace Hybrid Beamforming for Head-worn Microphone Arrays

Add code
Mar 15, 2023
Figure 1 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Figure 2 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Figure 3 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Figure 4 for Subspace Hybrid Beamforming for Head-worn Microphone Arrays
Viaarxiv icon