Picture for Jianqin Yin

Jianqin Yin

Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation

Add code
Sep 16, 2024
Viaarxiv icon

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection

Add code
Sep 11, 2024
Viaarxiv icon

SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation

Add code
Aug 09, 2024
Figure 1 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Figure 2 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Figure 3 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Figure 4 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Viaarxiv icon

ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality

Add code
Jul 29, 2024
Viaarxiv icon

Micro-expression recognition based on depth map to point cloud

Add code
Jun 12, 2024
Viaarxiv icon

CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering

Add code
May 13, 2024
Viaarxiv icon

DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation

Add code
Apr 27, 2024
Viaarxiv icon

OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

Add code
Apr 25, 2024
Viaarxiv icon

Towards more realistic human motion prediction with attention to motion coordination

Add code
Apr 04, 2024
Viaarxiv icon

Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection

Add code
Jan 10, 2024
Viaarxiv icon