Picture for Liang Ding

Liang Ding

IndustryBench-MIPU: Benchmarking Multi-Image Attribute Value Extraction for Industrial Products

Add code
Jun 12, 2026
Viaarxiv icon

ARBOR: Online Process Rewards via a Reusable Rubric Buffer for Search Agents

Add code
Jun 02, 2026
Viaarxiv icon

Better, Faster: Harnessing Self-Improvement in Large Reasoning Models

Add code
May 24, 2026
Viaarxiv icon

Learn to Think: Improving Multimodal Reasoning through Vision-Aware Self-Improvement Training

Add code
May 12, 2026
Viaarxiv icon

IndustryBench: Probing the Industrial Knowledge Boundaries of LLMs

Add code
May 11, 2026
Viaarxiv icon

A Multimodal Dataset for Visually Grounded Ambiguity in Machine Translation

Add code
May 03, 2026
Viaarxiv icon

Universally Empowering Zeroth-Order Optimization via Adaptive Layer-wise Sampling

Add code
Apr 20, 2026
Viaarxiv icon

AgentHER: Hindsight Experience Replay for LLM Agent Trajectory Relabeling

Add code
Mar 22, 2026
Viaarxiv icon

AdaRubric: Task-Adaptive Rubrics for LLM Agent Evaluation

Add code
Mar 22, 2026
Viaarxiv icon

AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning

Add code
Feb 26, 2026
Viaarxiv icon