Picture for Kuan Li

Kuan Li

SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes

Add code
Jun 01, 2026
Viaarxiv icon

HomeFlow: A Data Flywheel for Smart Home Agent Training with Verifiable Simulation

Add code
May 31, 2026
Viaarxiv icon

EvoCode-Bench: Evaluating Coding Agents in Multi-Turn Iterative Interactions

Add code
May 22, 2026
Viaarxiv icon

Route Before Retrieve: Activating Latent Routing Abilities of LLMs for RAG vs. Long-Context Selection

Add code
May 11, 2026
Viaarxiv icon

BabyVision: Visual Reasoning Beyond Language

Add code
Jan 10, 2026
Viaarxiv icon

Nested Browser-Use Learning for Agentic Information Seeking

Add code
Dec 29, 2025
Viaarxiv icon

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Add code
Nov 10, 2025
Viaarxiv icon

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization

Add code
Sep 16, 2025
Figure 1 for ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Figure 2 for ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Figure 3 for ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Figure 4 for ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
Viaarxiv icon

Scaling Agents via Continual Pre-training

Add code
Sep 16, 2025
Viaarxiv icon

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Add code
Sep 16, 2025
Figure 1 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 2 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 3 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Figure 4 for WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents
Viaarxiv icon