Picture for Meng Wang

Meng Wang

School of Electronic and Information Engineering Liaoning Technical University Xingcheng City, Liaoning Province, P. R. China

VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?

Add code
Dec 23, 2025
Figure 1 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Figure 2 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Figure 3 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Figure 4 for VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
Viaarxiv icon

Kling-Omni Technical Report

Add code
Dec 18, 2025
Figure 1 for Kling-Omni Technical Report
Figure 2 for Kling-Omni Technical Report
Figure 3 for Kling-Omni Technical Report
Figure 4 for Kling-Omni Technical Report
Viaarxiv icon

Cross-modal Fundus Image Registration under Large FoV Disparity

Add code
Dec 14, 2025
Viaarxiv icon

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning

Add code
Dec 14, 2025
Viaarxiv icon

FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion

Add code
Dec 12, 2025
Viaarxiv icon

Mitigating Recommendation Biases via Group-Alignment and Global-Uniformity in Representation Learning

Add code
Nov 17, 2025
Viaarxiv icon

Towards Non-Stationary Time Series Forecasting with Temporal Stabilization and Frequency Differencing

Add code
Nov 17, 2025
Viaarxiv icon

Disentangling Emotional Bases and Transient Fluctuations: A Low-Rank Sparse Decomposition Approach for Video Affective Analysis

Add code
Nov 14, 2025
Viaarxiv icon

Mamba-driven multi-perspective structural understanding for molecular ground-state conformation prediction

Add code
Nov 10, 2025
Viaarxiv icon

GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping

Add code
Oct 25, 2025
Viaarxiv icon