Picture for Jiahao Qiu

Jiahao Qiu

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Add code
Dec 23, 2025
Viaarxiv icon

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Add code
Dec 21, 2025
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

Add code
Jun 23, 2025
Figure 1 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 2 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 3 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Figure 4 for ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
Viaarxiv icon

AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes

Add code
Jun 17, 2025
Figure 1 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 2 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 3 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Figure 4 for AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes
Viaarxiv icon

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Add code
May 26, 2025
Viaarxiv icon

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Add code
May 26, 2025
Figure 1 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Figure 2 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Figure 3 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Figure 4 for Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution
Viaarxiv icon

Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?

Add code
May 21, 2025
Figure 1 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Figure 2 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Figure 3 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Figure 4 for Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Figure 1 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 2 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 3 for OTC: Optimal Tool Calls via Reinforcement Learning
Figure 4 for OTC: Optimal Tool Calls via Reinforcement Learning
Viaarxiv icon

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

Add code
Apr 13, 2025
Viaarxiv icon