Picture for Boyu Gou

Boyu Gou

Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

Add code
Feb 19, 2025
Viaarxiv icon

Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving

Add code
Nov 11, 2024
Figure 1 for Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving
Figure 2 for Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving
Figure 3 for Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving
Figure 4 for Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving
Viaarxiv icon

Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents

Add code
Nov 10, 2024
Figure 1 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Figure 2 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Figure 3 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Figure 4 for Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Viaarxiv icon

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Add code
Oct 07, 2024
Figure 1 for Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Figure 2 for Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Figure 3 for Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Figure 4 for Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
Viaarxiv icon

GPT-4V is a Generalist Web Agent, if Grounded

Add code
Jan 03, 2024
Viaarxiv icon