Picture for Jianzhong Ju

Jianzhong Ju

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

Add code
Feb 03, 2026
Viaarxiv icon

Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Federated Balanced Learning

Add code
Jan 20, 2026
Viaarxiv icon

Federated Joint Learning for Domain and Class Generalization

Add code
Jan 18, 2026
Viaarxiv icon

Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding

Add code
Jan 16, 2026
Viaarxiv icon

Xiaomi MiMo-VL-Miloco Technical Report

Add code
Dec 22, 2025
Figure 1 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 2 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 3 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 4 for Xiaomi MiMo-VL-Miloco Technical Report
Viaarxiv icon

REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding

Add code
Nov 17, 2025
Viaarxiv icon

Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle

Add code
Aug 07, 2025
Viaarxiv icon

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Add code
May 22, 2025
Figure 1 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 2 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 3 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Figure 4 for Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
Viaarxiv icon

Direction-Aware Diagonal Autoregressive Image Generation

Add code
Mar 14, 2025
Viaarxiv icon