Picture for Mengdi Wang

Mengdi Wang

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

Add code
Jun 23, 2025
Viaarxiv icon

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Add code
Jun 17, 2025
Viaarxiv icon

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

Add code
Jun 04, 2025
Viaarxiv icon

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Genome-Bench: A Scientific Reasoning Benchmark from Real-World Expert Discussions

Add code
May 26, 2025
Viaarxiv icon

Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?

Add code
May 21, 2025
Viaarxiv icon

MMaDA: Multimodal Large Diffusion Language Models

Add code
May 21, 2025
Viaarxiv icon

PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking

Add code
May 03, 2025
Viaarxiv icon

WenyanGPT: A Large Language Model for Classical Chinese Tasks

Add code
Apr 29, 2025
Viaarxiv icon