Picture for Jianqin Yin

Jianqin Yin

MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection

Add code
Dec 02, 2024
Figure 1 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Figure 2 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Figure 3 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Figure 4 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Viaarxiv icon

InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models

Add code
Nov 18, 2024
Figure 1 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Figure 2 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Figure 3 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Figure 4 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Viaarxiv icon

Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation

Add code
Sep 16, 2024
Figure 1 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Figure 2 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Figure 3 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Figure 4 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Viaarxiv icon

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection

Add code
Sep 11, 2024
Figure 1 for MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
Figure 2 for MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
Figure 3 for MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
Figure 4 for MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection
Viaarxiv icon

SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation

Add code
Aug 09, 2024
Figure 1 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Figure 2 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Figure 3 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Figure 4 for SELD-Mamba: Selective State-Space Model for Sound Event Localization and Detection with Source Distance Estimation
Viaarxiv icon

ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality

Add code
Jul 29, 2024
Viaarxiv icon

Micro-expression recognition based on depth map to point cloud

Add code
Jun 12, 2024
Viaarxiv icon

CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering

Add code
May 13, 2024
Viaarxiv icon

DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation

Add code
Apr 27, 2024
Figure 1 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 2 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 3 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 4 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Viaarxiv icon

OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

Add code
Apr 25, 2024
Figure 1 for OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation
Figure 2 for OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation
Figure 3 for OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation
Figure 4 for OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation
Viaarxiv icon