Picture for Ben Wiesel

Ben Wiesel

ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents

Add code
Oct 10, 2024
Figure 1 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Figure 2 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Figure 3 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Figure 4 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Viaarxiv icon