Picture for Philip J. B. Jackson

Philip J. B. Jackson

Deconstruct Complexity (DeComplex): A Novel Perspective on Tackling Dense Action Detection

Add code
Jan 30, 2025
Viaarxiv icon

Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation

Add code
Oct 29, 2024
Figure 1 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 2 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 3 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 4 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Viaarxiv icon

An Effective-Efficient Approach for Dense Multi-Label Action Detection

Add code
Jun 10, 2024
Figure 1 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Figure 2 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Figure 3 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Figure 4 for An Effective-Efficient Approach for Dense Multi-Label Action Detection
Viaarxiv icon

Audio-Visual Talker Localization in Video for Spatial Sound Reproduction

Add code
Jun 01, 2024
Figure 1 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 2 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 3 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 4 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Viaarxiv icon

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

Add code
May 17, 2024
Viaarxiv icon

Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization

Add code
Dec 21, 2023
Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Figure 1 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 2 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 3 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Figure 4 for Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions
Viaarxiv icon

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection

Add code
Aug 09, 2023
Viaarxiv icon

Audio Inputs for Active Speaker Detection and Localization via Microphone Array

Add code
Jul 27, 2023
Figure 1 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 2 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 3 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 4 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Viaarxiv icon