Picture for Peng Ye

Peng Ye

Can Multimodal Large Language Models Truly Understand Small Objects?

Add code
Apr 24, 2026
Viaarxiv icon

MLG-Stereo: ViT Based Stereo Matching with Multi-Stage Local-Global Enhancement

Add code
Apr 22, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Add code
Feb 09, 2026
Viaarxiv icon

MARTI-MARS$^2$: Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation

Add code
Feb 08, 2026
Viaarxiv icon

FreshMem: Brain-Inspired Frequency-Space Hybrid Memory for Streaming Video Understanding

Add code
Feb 02, 2026
Viaarxiv icon

A Unified Study of LoRA Variants: Taxonomy, Review, Codebase, and Empirical Evaluation

Add code
Jan 30, 2026
Viaarxiv icon

Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

Add code
Jan 30, 2026
Viaarxiv icon

FRISM: Fine-Grained Reasoning Injection via Subspace-Level Model Merging for Vision-Language Models

Add code
Jan 29, 2026
Viaarxiv icon

LSTM-MAS: A Long Short-Term Memory Inspired Multi-Agent System for Long-Context Understanding

Add code
Jan 17, 2026
Viaarxiv icon