Picture for Yonghao Dang

Yonghao Dang

L2HCount:Generalizing Crowd Counting from Low to High Crowd Density via Density Simulation

Add code
Mar 17, 2025
Viaarxiv icon

GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting

Add code
Mar 07, 2025
Figure 1 for GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting
Figure 2 for GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting
Figure 3 for GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting
Figure 4 for GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting
Viaarxiv icon

QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning

Add code
Dec 23, 2024
Viaarxiv icon

MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection

Add code
Dec 02, 2024
Figure 1 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Figure 2 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Figure 3 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Figure 4 for MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection
Viaarxiv icon

Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation

Add code
Sep 16, 2024
Figure 1 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Figure 2 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Figure 3 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Figure 4 for Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation
Viaarxiv icon

ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality

Add code
Jul 29, 2024
Figure 1 for ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality
Figure 2 for ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality
Figure 3 for ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality
Figure 4 for ActivityCLIP: Enhancing Group Activity Recognition by Mining Complementary Information from Text to Supplement Image Modality
Viaarxiv icon

Micro-expression recognition based on depth map to point cloud

Add code
Jun 12, 2024
Viaarxiv icon

DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation

Add code
Apr 27, 2024
Figure 1 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 2 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 3 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Figure 4 for DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation
Viaarxiv icon

Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection

Add code
Jan 10, 2024
Figure 1 for Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
Figure 2 for Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
Figure 3 for Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
Figure 4 for Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
Viaarxiv icon

Spatial-Temporal Decoupling Contrastive Learning for Skeleton-based Human Action Recognition

Add code
Jan 09, 2024
Viaarxiv icon