Picture for Xuyang Liu

Xuyang Liu

Seeing Sarcasm Through Different Eyes: Analyzing Multimodal Sarcasm Perception in Large Vision-Language Models

Add code
Mar 15, 2025
Viaarxiv icon

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration

Add code
Jan 09, 2025
Viaarxiv icon

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Add code
Nov 26, 2024
Viaarxiv icon

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models

Add code
Oct 29, 2024
Viaarxiv icon

Accelerating Diffusion Transformers with Token-wise Feature Caching

Add code
Oct 14, 2024
Viaarxiv icon

M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension

Add code
Jul 01, 2024
Figure 1 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 2 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 3 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 4 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Viaarxiv icon

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

Add code
May 23, 2024
Figure 1 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 2 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 3 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Figure 4 for Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
Viaarxiv icon

Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment

Add code
May 15, 2024
Figure 1 for Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment
Figure 2 for Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment
Figure 3 for Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment
Figure 4 for Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment
Viaarxiv icon

DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding

Add code
May 10, 2024
Viaarxiv icon

VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

Add code
Sep 03, 2023
Viaarxiv icon