Picture for Wei Ji

Wei Ji

TexEditor: Structure-Preserving Text-Driven Texture Editing

Add code
Mar 19, 2026
Viaarxiv icon

Selective Noise Suppression and Discriminative Mutual Interaction for Robust Audio-Visual Segmentation

Add code
Mar 15, 2026
Viaarxiv icon

UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark

Add code
Mar 05, 2026
Viaarxiv icon

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Add code
Feb 11, 2026
Viaarxiv icon

Interp3D: Correspondence-aware Interpolation for Generative Textured 3D Morphing

Add code
Jan 20, 2026
Viaarxiv icon

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

Learning Multi-Modal Mobility Dynamics for Generalized Next Location Recommendation

Add code
Dec 27, 2025
Viaarxiv icon

Surgical Scene Segmentation using a Spike-Driven Video Transformer with Real-Time Potential

Add code
Dec 24, 2025
Viaarxiv icon

Step-DeepResearch Technical Report

Add code
Dec 24, 2025
Viaarxiv icon

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Add code
Nov 13, 2025
Figure 1 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 2 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 3 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Figure 4 for MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs
Viaarxiv icon