Picture for Shuning Chang

Shuning Chang

FlexDiT: Dynamic Token Density Control for Diffusion Transformer

Add code
Dec 08, 2024
Viaarxiv icon

Revisiting Vision Transformer from the View of Path Ensemble

Add code
Aug 12, 2023
Viaarxiv icon

Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

Add code
May 29, 2023
Viaarxiv icon

DOAD: Decoupled One Stage Action Detection Network

Add code
Apr 04, 2023
Viaarxiv icon

Making Vision Transformers Efficient from A Token Sparsification View

Add code
Mar 30, 2023
Figure 1 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 2 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 3 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 4 for Making Vision Transformers Efficient from A Token Sparsification View
Viaarxiv icon

KVT: k-NN Attention for Boosting Vision Transformers

Add code
May 28, 2021
Figure 1 for KVT: k-NN Attention for Boosting Vision Transformers
Figure 2 for KVT: k-NN Attention for Boosting Vision Transformers
Figure 3 for KVT: k-NN Attention for Boosting Vision Transformers
Figure 4 for KVT: k-NN Attention for Boosting Vision Transformers
Viaarxiv icon

Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation

Add code
Mar 30, 2021
Figure 1 for Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
Figure 2 for Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
Figure 3 for Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
Figure 4 for Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
Viaarxiv icon

Towards Accurate Human Pose Estimation in Videos of Crowded Scenes

Add code
Oct 21, 2020
Figure 1 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Figure 2 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Figure 3 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Figure 4 for Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
Viaarxiv icon

A Simple Baseline for Pose Tracking in Videos of Crowded Scenes

Add code
Oct 21, 2020
Figure 1 for A Simple Baseline for Pose Tracking in Videos of Crowded Scenes
Figure 2 for A Simple Baseline for Pose Tracking in Videos of Crowded Scenes
Figure 3 for A Simple Baseline for Pose Tracking in Videos of Crowded Scenes
Figure 4 for A Simple Baseline for Pose Tracking in Videos of Crowded Scenes
Viaarxiv icon

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes

Add code
Oct 16, 2020
Figure 1 for Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes
Figure 2 for Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes
Figure 3 for Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes
Figure 4 for Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes
Viaarxiv icon