Picture for Rongrong Ji

Rongrong Ji

Xiamen University, Peng Cheng Laboratory

Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs

Add code
Jun 07, 2026
Viaarxiv icon

ForensicConcept: Transferable Forensic Concepts for AIGI Detection

Add code
Jun 05, 2026
Viaarxiv icon

Look on Demand: A Cognitive Scheduling Framework for Visual Evidence Acquisition in Multimodal Reasoning

Add code
May 27, 2026
Viaarxiv icon

HASTE: Training-Free Video Diffusion Acceleration via Head-Wise Adaptive Sparse Attention

Add code
May 14, 2026
Viaarxiv icon

ALGOGEN: Tool-Generated Verifiable Traces for Reliable Algorithm Visualization

Add code
May 12, 2026
Viaarxiv icon

Motion-Aware Caching for Efficient Autoregressive Video Generation

Add code
May 03, 2026
Viaarxiv icon

Prototype-Based Test-Time Adaptation of Vision-Language Models

Add code
Apr 23, 2026
Viaarxiv icon

ID-Selection: Importance-Diversity Based Visual Token Selection for Efficient LVLM Inference

Add code
Apr 07, 2026
Viaarxiv icon

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

Add code
Mar 31, 2026
Viaarxiv icon

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

Add code
Mar 24, 2026
Viaarxiv icon