Picture for Xuanhan Wang

Xuanhan Wang

Scale-Aware Pre-Training for Human-Centric Visual Perception: Enabling Lightweight and Generalizable Models

Add code
Mar 11, 2025
Viaarxiv icon

Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model

Add code
Nov 29, 2024
Figure 1 for Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model
Figure 2 for Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model
Figure 3 for Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model
Figure 4 for Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model
Viaarxiv icon

Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles

Add code
Sep 10, 2024
Viaarxiv icon

Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection

Add code
Jul 17, 2024
Viaarxiv icon

X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention

Add code
Oct 12, 2023
Viaarxiv icon

CIParsing: Unifying Causality Properties into Multiple Human Parsing

Add code
Aug 23, 2023
Figure 1 for CIParsing: Unifying Causality Properties into Multiple Human Parsing
Figure 2 for CIParsing: Unifying Causality Properties into Multiple Human Parsing
Figure 3 for CIParsing: Unifying Causality Properties into Multiple Human Parsing
Figure 4 for CIParsing: Unifying Causality Properties into Multiple Human Parsing
Viaarxiv icon

RepParser: End-to-End Multiple Human Parsing with Representative Parts

Add code
Aug 27, 2022
Figure 1 for RepParser: End-to-End Multiple Human Parsing with Representative Parts
Figure 2 for RepParser: End-to-End Multiple Human Parsing with Representative Parts
Figure 3 for RepParser: End-to-End Multiple Human Parsing with Representative Parts
Figure 4 for RepParser: End-to-End Multiple Human Parsing with Representative Parts
Viaarxiv icon

Skeleton-based Action Recognition via Adaptive Cross-Form Learning

Add code
Jun 30, 2022
Figure 1 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 2 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 3 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Figure 4 for Skeleton-based Action Recognition via Adaptive Cross-Form Learning
Viaarxiv icon

KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing

Add code
Jun 21, 2022
Figure 1 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 2 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 3 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Figure 4 for KE-RCNN: Unifying Knowledge based Reasoning into Part-level Attribute Parsing
Viaarxiv icon

KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences

Add code
Jun 21, 2022
Figure 1 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 2 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 3 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Figure 4 for KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences
Viaarxiv icon