Picture for Jingbo Zhu

Jingbo Zhu

MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning

Add code
Mar 26, 2026
Viaarxiv icon

PoC: Performance-oriented Context Compression for Large Language Models via Performance Prediction

Add code
Mar 20, 2026
Viaarxiv icon

DaPT: A Dual-Path Framework for Multilingual Multi-hop Question Answering

Add code
Mar 19, 2026
Viaarxiv icon

On the Emotion Understanding of Synthesized Speech

Add code
Mar 17, 2026
Viaarxiv icon

StyleBench: Evaluating Speech Language Models on Conversational Speaking Style Control

Add code
Mar 08, 2026
Viaarxiv icon

When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

Add code
Feb 28, 2026
Viaarxiv icon

CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling

Add code
Feb 02, 2026
Viaarxiv icon

APR: Penalizing Structural Redundancy in Large Reasoning Models via Anchor-based Process Rewards

Add code
Jan 31, 2026
Viaarxiv icon

SERM: Self-Evolving Relevance Model with Agent-Driven Learning from Massive Query Streams

Add code
Jan 14, 2026
Viaarxiv icon

Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models

Add code
Nov 16, 2025
Viaarxiv icon