Picture for Jiayi Ji

Jiayi Ji

HieraVid: Hierarchical Token Pruning for Fast Video Large Language Models

Add code
Apr 02, 2026
Viaarxiv icon

Persistent Story World Simulation with Continuous Character Customization

Add code
Mar 17, 2026
Viaarxiv icon

FIND: A Simple yet Effective Baseline for Diffusion-Generated Image Detection

Add code
Mar 15, 2026
Viaarxiv icon

3D-DRES: Detailed 3D Referring Expression Segmentation

Add code
Mar 03, 2026
Viaarxiv icon

Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding

Add code
Feb 28, 2026
Viaarxiv icon

MICON-Bench: Benchmarking and Enhancing Multi-Image Context Image Generation in Unified Multimodal Models

Add code
Feb 23, 2026
Viaarxiv icon

Test-Time Computing for Referring Multimodal Large Language Models

Add code
Feb 23, 2026
Viaarxiv icon

SafeNeuron: Neuron-Level Safety Alignment for Large Language Models

Add code
Feb 12, 2026
Viaarxiv icon

MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation

Add code
Jan 13, 2026
Viaarxiv icon

CSMCIR: CoT-Enhanced Symmetric Alignment with Memory Bank for Composed Image Retrieval

Add code
Jan 07, 2026
Viaarxiv icon