Picture for Keyu Li

Keyu Li

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Add code
Feb 02, 2026
Viaarxiv icon

AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts

Add code
Jan 16, 2026
Viaarxiv icon

InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research

Add code
Nov 03, 2025
Figure 1 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Figure 2 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Figure 3 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Figure 4 for InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research
Viaarxiv icon

Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training

Add code
Nov 03, 2025
Viaarxiv icon

DatasetResearch: Benchmarking Agent Systems for Demand-Driven Dataset Discovery

Add code
Aug 09, 2025
Viaarxiv icon

RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer

Add code
May 29, 2025
Viaarxiv icon

SEM: Enhancing Spatial Understanding for Robust Robot Manipulation

Add code
May 22, 2025
Viaarxiv icon

Deep learning automates Cobb angle measurement compared with multi-expert observers

Add code
Mar 18, 2024
Viaarxiv icon

Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators

Add code
Jun 13, 2023
Figure 1 for Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators
Figure 2 for Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators
Figure 3 for Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators
Figure 4 for Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators
Viaarxiv icon

Style Transfer Enabled Sim2Real Framework for Efficient Learning of Robotic Ultrasound Image Analysis Using Simulated Data

Add code
May 16, 2023
Viaarxiv icon