Picture for Yansong Tang

Yansong Tang

Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting

Add code
Jan 13, 2025
Viaarxiv icon

AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation

Add code
Dec 09, 2024
Viaarxiv icon

Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction

Add code
Dec 06, 2024
Viaarxiv icon

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control

Add code
Dec 02, 2024
Viaarxiv icon

ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models

Add code
Nov 30, 2024
Viaarxiv icon

UVCG: Leveraging Temporal Consistency for Universal Video Protection

Add code
Nov 25, 2024
Viaarxiv icon

NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model

Add code
Nov 25, 2024
Figure 1 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 2 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 3 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 4 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Viaarxiv icon

Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation

Add code
Nov 24, 2024
Viaarxiv icon

Q-VLM: Post-training Quantization for Large Vision-Language Models

Add code
Oct 10, 2024
Figure 1 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 2 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 3 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 4 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Viaarxiv icon

Fully Aligned Network for Referring Image Segmentation

Add code
Sep 29, 2024
Viaarxiv icon