Picture for Siteng Huang

Siteng Huang

Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation

Add code
Dec 13, 2024
Viaarxiv icon

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Add code
Dec 09, 2024
Viaarxiv icon

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Add code
Nov 26, 2024
Viaarxiv icon

Accelerating Diffusion Transformers with Token-wise Feature Caching

Add code
Oct 14, 2024
Viaarxiv icon

ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification

Add code
Sep 30, 2024
Figure 1 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Figure 2 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Figure 3 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Figure 4 for ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification
Viaarxiv icon

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Add code
Sep 11, 2024
Viaarxiv icon

Focus-Consistent Multi-Level Aggregation for Compositional Zero-Shot Learning

Add code
Aug 30, 2024
Viaarxiv icon

M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension

Add code
Jul 01, 2024
Figure 1 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 2 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 3 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 4 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Viaarxiv icon

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

Add code
May 23, 2024
Viaarxiv icon

DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding

Add code
May 10, 2024
Viaarxiv icon