Picture for Yansong Tang

Yansong Tang

AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation

Add code
Dec 09, 2024
Viaarxiv icon

Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction

Add code
Dec 06, 2024
Viaarxiv icon

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control

Add code
Dec 02, 2024
Viaarxiv icon

ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models

Add code
Nov 30, 2024
Viaarxiv icon

UVCG: Leveraging Temporal Consistency for Universal Video Protection

Add code
Nov 25, 2024
Viaarxiv icon

NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model

Add code
Nov 25, 2024
Figure 1 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 2 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 3 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 4 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Viaarxiv icon

Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation

Add code
Nov 24, 2024
Viaarxiv icon

Q-VLM: Post-training Quantization for Large Vision-Language Models

Add code
Oct 10, 2024
Figure 1 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 2 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 3 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 4 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Viaarxiv icon

Fully Aligned Network for Referring Image Segmentation

Add code
Sep 29, 2024
Viaarxiv icon

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Add code
Aug 01, 2024
Viaarxiv icon