Picture for Hongru Wang

Hongru Wang

Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration

Add code
Feb 03, 2026
Viaarxiv icon

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

Add code
Feb 02, 2026
Viaarxiv icon

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents

Add code
Jan 29, 2026
Viaarxiv icon

Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency

Add code
Jan 09, 2026
Viaarxiv icon

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Add code
Dec 21, 2025
Viaarxiv icon

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Add code
Oct 16, 2025
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

Perception-Aware Policy Optimization for Multimodal Reasoning

Add code
Jul 08, 2025
Figure 1 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 2 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 3 for Perception-Aware Policy Optimization for Multimodal Reasoning
Figure 4 for Perception-Aware Policy Optimization for Multimodal Reasoning
Viaarxiv icon

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Add code
Jun 17, 2025
Figure 1 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 2 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 3 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 4 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Viaarxiv icon

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Add code
May 27, 2025
Viaarxiv icon