Picture for Philip J. B. Jackson

Philip J. B. Jackson

Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation

Add code
Oct 29, 2024
Figure 1 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 2 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 3 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 4 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Viaarxiv icon

An Effective-Efficient Approach for Dense Multi-Label Action Detection

Add code
Jun 10, 2024
Figure 1 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Figure 2 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Figure 3 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Figure 4 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Viaarxiv icon

Audio-Visual Talker Localization in Video for Spatial Sound Reproduction

Add code
Jun 01, 2024
Figure 1 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 2 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 3 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 4 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Viaarxiv icon

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

Add code
May 17, 2024
Viaarxiv icon

Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization

Add code
Dec 21, 2023
Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Viaarxiv icon

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection

Add code
Aug 09, 2023
Viaarxiv icon

Audio Inputs for Active Speaker Detection and Localization via Microphone Array

Add code
Jul 27, 2023
Figure 1 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 2 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 3 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 4 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Viaarxiv icon

Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research

Add code
Dec 04, 2022
Viaarxiv icon