Picture for Shouhong Ding

Shouhong Ding

D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning

Add code
Dec 22, 2025
Figure 1 for D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning
Figure 2 for D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning
Figure 3 for D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning
Figure 4 for D2Pruner: Debiased Importance and Structural Diversity for MLLM Token Pruning
Viaarxiv icon

GloTok: Global Perspective Tokenizer for Image Reconstruction and Generation

Add code
Nov 19, 2025
Viaarxiv icon

TripleFDS: Triple Feature Disentanglement and Synthesis for Scene Text Editing

Add code
Nov 17, 2025
Viaarxiv icon

Switchable Token-Specific Codebook Quantization For Face Image Compression

Add code
Oct 27, 2025
Viaarxiv icon

Towards Rationale-Answer Alignment of LVLMs via Self-Rationale Calibration

Add code
Sep 17, 2025
Viaarxiv icon

VISA: Group-wise Visual Token Selection and Aggregation via Graph Summarization for Efficient MLLMs Inference

Add code
Aug 25, 2025
Viaarxiv icon

MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic Learning

Add code
Jul 16, 2025
Viaarxiv icon

AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models

Add code
Jul 03, 2025
Figure 1 for AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
Figure 2 for AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
Figure 3 for AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
Figure 4 for AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models
Viaarxiv icon

Guard Me If You Know Me: Protecting Specific Face-Identity from Deepfakes

Add code
May 26, 2025
Viaarxiv icon

Dual Data Alignment Makes AI-Generated Image Detector Easier Generalizable

Add code
May 20, 2025
Viaarxiv icon