Picture for Jiayi Ji

Jiayi Ji

3D-DRES: Detailed 3D Referring Expression Segmentation

Add code
Mar 03, 2026
Viaarxiv icon

Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding

Add code
Feb 28, 2026
Viaarxiv icon

MICON-Bench: Benchmarking and Enhancing Multi-Image Context Image Generation in Unified Multimodal Models

Add code
Feb 23, 2026
Viaarxiv icon

Test-Time Computing for Referring Multimodal Large Language Models

Add code
Feb 23, 2026
Viaarxiv icon

SafeNeuron: Neuron-Level Safety Alignment for Large Language Models

Add code
Feb 12, 2026
Viaarxiv icon

MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation

Add code
Jan 13, 2026
Viaarxiv icon

CSMCIR: CoT-Enhanced Symmetric Alignment with Memory Bank for Composed Image Retrieval

Add code
Jan 07, 2026
Viaarxiv icon

Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting

Add code
Dec 31, 2025
Viaarxiv icon

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Add code
Dec 28, 2025
Viaarxiv icon

Understanding What Is Not Said:Referring Remote Sensing Image Segmentation with Scarce Expressions

Add code
Oct 26, 2025
Viaarxiv icon