Picture for Xu Tang

Xu Tang

CLIP-Map: Structured Matrix Mapping for Parameter-Efficient CLIP Compression

Add code
Feb 05, 2026
Viaarxiv icon

IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning

Add code
Feb 03, 2026
Viaarxiv icon

Federated Multi-Task Clustering

Add code
Dec 30, 2025
Viaarxiv icon

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Add code
Nov 10, 2025
Viaarxiv icon

ASM-UNet: Adaptive Scan Mamba Integrating Group Commonalities and Individual Variations for Fine-Grained Segmentation

Add code
Aug 10, 2025
Viaarxiv icon

ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model

Add code
Apr 16, 2025
Figure 1 for ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model
Figure 2 for ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model
Figure 3 for ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model
Figure 4 for ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model
Viaarxiv icon

LexPam: Legal Procedure Awareness-Guided Mathematical Reasoning

Add code
Apr 03, 2025
Viaarxiv icon

MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery

Add code
Mar 14, 2025
Viaarxiv icon

FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration

Add code
Jan 24, 2025
Figure 1 for FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
Figure 2 for FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
Figure 3 for FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
Figure 4 for FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration
Viaarxiv icon

DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors

Add code
Jan 15, 2025
Figure 1 for DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors
Figure 2 for DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors
Figure 3 for DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors
Figure 4 for DynamicFace: High-Quality and Consistent Video Face Swapping using Composable 3D Facial Priors
Viaarxiv icon