Picture for Yu Meng

Yu Meng

AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning

Add code
Dec 18, 2025
Viaarxiv icon

Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents

Add code
Oct 06, 2025
Viaarxiv icon

GLD-Road:A global-local decoding road network extraction model for remote sensing images

Add code
Jun 11, 2025
Figure 1 for GLD-Road:A global-local decoding road network extraction model for remote sensing images
Figure 2 for GLD-Road:A global-local decoding road network extraction model for remote sensing images
Figure 3 for GLD-Road:A global-local decoding road network extraction model for remote sensing images
Figure 4 for GLD-Road:A global-local decoding road network extraction model for remote sensing images
Viaarxiv icon

ProxyThinker: Test-Time Guidance through Small Visual Reasoners

Add code
May 30, 2025
Viaarxiv icon

Human in the Loop Adaptive Optimization for Improved Time Series Forecasting

Add code
May 21, 2025
Viaarxiv icon

Do LLM Evaluators Prefer Themselves for a Reason?

Add code
Apr 04, 2025
Viaarxiv icon

LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text

Add code
Mar 25, 2025
Figure 1 for LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text
Figure 2 for LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text
Figure 3 for LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text
Figure 4 for LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text
Viaarxiv icon

PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models

Add code
Feb 20, 2025
Figure 1 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Figure 2 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Figure 3 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Figure 4 for PLPHP: Per-Layer Per-Head Vision Token Pruning for Efficient Large Vision-Language Models
Viaarxiv icon

SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion

Add code
Feb 17, 2025
Figure 1 for SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Figure 2 for SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Figure 3 for SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Figure 4 for SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Viaarxiv icon

LLM Alignment as Retriever Optimization: An Information Retrieval Perspective

Add code
Feb 06, 2025
Figure 1 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Figure 2 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Figure 3 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Figure 4 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Viaarxiv icon