Picture for Yansong Tang

Yansong Tang

IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis

Add code
Mar 02, 2025
Viaarxiv icon

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Add code
Feb 25, 2025
Viaarxiv icon

GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting

Add code
Jan 26, 2025
Viaarxiv icon

Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting

Add code
Jan 13, 2025
Figure 1 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Figure 2 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Figure 3 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Figure 4 for Localization-Aware Multi-Scale Representation Learning for Repetitive Action Counting
Viaarxiv icon

AnyBimanual: Transferring Unimanual Policy for General Bimanual Manipulation

Add code
Dec 09, 2024
Viaarxiv icon

Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction

Add code
Dec 06, 2024
Viaarxiv icon

Ponder & Press: Advancing Visual GUI Agent towards General Computer Control

Add code
Dec 02, 2024
Viaarxiv icon

ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models

Add code
Nov 30, 2024
Viaarxiv icon

UVCG: Leveraging Temporal Consistency for Universal Video Protection

Add code
Nov 25, 2024
Viaarxiv icon

NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model

Add code
Nov 25, 2024
Figure 1 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 2 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 3 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Figure 4 for NovelGS: Consistent Novel-view Denoising via Large Gaussian Reconstruction Model
Viaarxiv icon