Picture for Honglin Guo

Honglin Guo

OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

Add code
Jan 16, 2026
Viaarxiv icon

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Add code
Jan 16, 2026
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Add code
Sep 10, 2025
Viaarxiv icon

Organ-Agents: Virtual Human Physiology Simulator via LLMs

Add code
Aug 20, 2025
Viaarxiv icon

TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models

Add code
Mar 10, 2025
Viaarxiv icon

Better Process Supervision with Bi-directional Rewarding Signals

Add code
Mar 06, 2025
Viaarxiv icon

CritiQ: Mining Data Quality Criteria from Human Preferences

Add code
Feb 26, 2025
Viaarxiv icon

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Add code
Jun 06, 2024
Figure 1 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 2 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 3 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Figure 4 for AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Viaarxiv icon

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation

Add code
Feb 20, 2024
Figure 1 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Figure 2 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Figure 3 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Figure 4 for Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Viaarxiv icon