Picture for Pu Wang

Pu Wang

UAKNN: Label Distribution Learning via Uncertainty-Aware KNN

Add code
Apr 02, 2025
Viaarxiv icon

PolypFlow: Reinforcing Polyp Segmentation with Flow-Driven Dynamics

Add code
Feb 26, 2025
Viaarxiv icon

SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living

Add code
Feb 05, 2025
Figure 1 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Figure 2 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Figure 3 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Figure 4 for SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living
Viaarxiv icon

mmCooper: A Multi-agent Multi-stage Communication-efficient and Collaboration-robust Cooperative Perception Framework

Add code
Jan 21, 2025
Viaarxiv icon

BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos

Add code
Jan 14, 2025
Viaarxiv icon

MS-Temba : Multi-Scale Temporal Mamba for Efficient Temporal Action Detection

Add code
Jan 10, 2025
Viaarxiv icon

Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation

Add code
Dec 29, 2024
Viaarxiv icon

GenHMR: Generative Human Mesh Recovery

Add code
Dec 19, 2024
Viaarxiv icon

MMHMR: Generative Masked Modeling for Hand Mesh Recovery

Add code
Dec 18, 2024
Figure 1 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Figure 2 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Figure 3 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Figure 4 for MMHMR: Generative Masked Modeling for Hand Mesh Recovery
Viaarxiv icon

Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation

Add code
Nov 26, 2024
Figure 1 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 2 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 3 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 4 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Viaarxiv icon