Picture for Wang Zeng

Wang Zeng

KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension

Add code
Nov 04, 2024
Figure 1 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 2 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 3 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Figure 4 for KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension
Viaarxiv icon

TCFormer: Visual Recognition via Token Clustering Transformer

Add code
Jul 16, 2024
Viaarxiv icon

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

Add code
Jul 14, 2024
Viaarxiv icon

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks

Add code
Feb 23, 2024
Viaarxiv icon

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

Add code
Aug 28, 2023
Figure 1 for GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Figure 2 for GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Figure 3 for GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Figure 4 for GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Viaarxiv icon

Pose for Everything: Towards Category-Agnostic Pose Estimation

Add code
Jul 21, 2022
Figure 1 for Pose for Everything: Towards Category-Agnostic Pose Estimation
Figure 2 for Pose for Everything: Towards Category-Agnostic Pose Estimation
Figure 3 for Pose for Everything: Towards Category-Agnostic Pose Estimation
Figure 4 for Pose for Everything: Towards Category-Agnostic Pose Estimation
Viaarxiv icon

Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer

Add code
Apr 21, 2022
Figure 1 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Figure 2 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Figure 3 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Figure 4 for Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
Viaarxiv icon

3D Human Mesh Regression with Dense Correspondence

Add code
Jun 10, 2020
Figure 1 for 3D Human Mesh Regression with Dense Correspondence
Figure 2 for 3D Human Mesh Regression with Dense Correspondence
Figure 3 for 3D Human Mesh Regression with Dense Correspondence
Figure 4 for 3D Human Mesh Regression with Dense Correspondence
Viaarxiv icon