Picture for Jiaze Li

Jiaze Li

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

Add code
Feb 03, 2026
Viaarxiv icon

SOMBRERO: Measuring and Steering Boundary Placement in End-to-End Hierarchical Sequence Models

Add code
Jan 30, 2026
Viaarxiv icon

Federated Balanced Learning

Add code
Jan 20, 2026
Viaarxiv icon

Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model

Add code
Jan 20, 2026
Viaarxiv icon

Federated Joint Learning for Domain and Class Generalization

Add code
Jan 18, 2026
Viaarxiv icon

Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding

Add code
Jan 16, 2026
Viaarxiv icon

Xiaomi MiMo-VL-Miloco Technical Report

Add code
Dec 22, 2025
Figure 1 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 2 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 3 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 4 for Xiaomi MiMo-VL-Miloco Technical Report
Viaarxiv icon

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Add code
Nov 17, 2025
Viaarxiv icon

ImagebindDC: Compressing Multi-modal Data with Imagebind-based Condensation

Add code
Nov 11, 2025
Figure 1 for ImagebindDC: Compressing Multi-modal Data with Imagebind-based Condensation
Figure 2 for ImagebindDC: Compressing Multi-modal Data with Imagebind-based Condensation
Figure 3 for ImagebindDC: Compressing Multi-modal Data with Imagebind-based Condensation
Figure 4 for ImagebindDC: Compressing Multi-modal Data with Imagebind-based Condensation
Viaarxiv icon

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Add code
May 22, 2025
Figure 1 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 2 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 3 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 4 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Viaarxiv icon