Picture for Alice Li

Alice Li

On the Effects of Data Scale on Computer Control Agents

Add code
Jun 06, 2024
Viaarxiv icon

AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents

Add code
May 23, 2024
Figure 1 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Figure 2 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Figure 3 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Figure 4 for AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Viaarxiv icon

Dissociation of Faithful and Unfaithful Reasoning in LLMs

Add code
May 23, 2024
Figure 1 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 2 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 3 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Figure 4 for Dissociation of Faithful and Unfaithful Reasoning in LLMs
Viaarxiv icon

Generative AI Search Engines as Arbiters of Public Knowledge: An Audit of Bias and Authority

Add code
May 22, 2024
Viaarxiv icon

Latent State Estimation Helps UI Agents to Reason

Add code
May 17, 2024
Viaarxiv icon

Android in the Wild: A Large-Scale Dataset for Android Device Control

Add code
Jul 19, 2023
Figure 1 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Figure 2 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Figure 3 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Figure 4 for Android in the Wild: A Large-Scale Dataset for Android Device Control
Viaarxiv icon

The 7th AI City Challenge

Add code
Apr 15, 2023
Figure 1 for The 7th AI City Challenge
Figure 2 for The 7th AI City Challenge
Figure 3 for The 7th AI City Challenge
Figure 4 for The 7th AI City Challenge
Viaarxiv icon

Productivity Assessment of Neural Code Completion

Add code
May 13, 2022
Figure 1 for Productivity Assessment of Neural Code Completion
Figure 2 for Productivity Assessment of Neural Code Completion
Figure 3 for Productivity Assessment of Neural Code Completion
Figure 4 for Productivity Assessment of Neural Code Completion
Viaarxiv icon

The 6th AI City Challenge

Add code
Apr 21, 2022
Figure 1 for The 6th AI City Challenge
Figure 2 for The 6th AI City Challenge
Figure 3 for The 6th AI City Challenge
Figure 4 for The 6th AI City Challenge
Viaarxiv icon