Picture for Ming-Ming Cheng

Ming-Ming Cheng

Nankai University

Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models

Add code
Mar 21, 2026
Viaarxiv icon

Mixture of Style Experts for Diverse Image Stylization

Add code
Mar 17, 2026
Viaarxiv icon

Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining

Add code
Mar 02, 2026
Viaarxiv icon

Test-Time Computing for Referring Multimodal Large Language Models

Add code
Feb 23, 2026
Viaarxiv icon

Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

Add code
Feb 13, 2026
Viaarxiv icon

FlowConsist: Make Your Flow Consistent with Real Trajectory

Add code
Feb 06, 2026
Viaarxiv icon

Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Add code
Feb 03, 2026
Viaarxiv icon

Predictive Sample Assignment for Semantically Coherent Out-of-Distribution Detection

Add code
Dec 15, 2025
Viaarxiv icon

Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery

Add code
Dec 15, 2025
Figure 1 for Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
Figure 2 for Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
Figure 3 for Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
Figure 4 for Sharpness-aware Dynamic Anchor Selection for Generalized Category Discovery
Viaarxiv icon

OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation

Add code
Sep 18, 2025
Viaarxiv icon