Picture for Segev Shlomov

Segev Shlomov

ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents

Add code
Oct 10, 2024
Figure 1 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Figure 2 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Figure 3 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Figure 4 for ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents
Viaarxiv icon

From Grounding to Planning: Benchmarking Bottlenecks in Web Agents

Add code
Sep 03, 2024
Viaarxiv icon

SNAP: Semantic Stories for Next Activity Prediction

Add code
Jan 28, 2024
Viaarxiv icon

Mimicking the Maestro: Exploring the Efficacy of a Virtual AI Teacher in Fine Motor Skill Acquisition

Add code
Oct 16, 2023
Viaarxiv icon

Enhancing Trust in LLM-Based AI Automation Agents: New Considerations and Future Challenges

Add code
Aug 10, 2023
Viaarxiv icon

Prescriptive Process Monitoring in Intelligent Process Automation with Chatbot Orchestration

Add code
Dec 13, 2022
Viaarxiv icon

Understanding the Properties of Generated Corpora

Add code
Jun 22, 2022
Figure 1 for Understanding the Properties of Generated Corpora
Figure 2 for Understanding the Properties of Generated Corpora
Figure 3 for Understanding the Properties of Generated Corpora
Figure 4 for Understanding the Properties of Generated Corpora
Viaarxiv icon

We've had this conversation before: A Novel Approach to Measuring Dialog Similarity

Add code
Oct 12, 2021
Figure 1 for We've had this conversation before: A Novel Approach to Measuring Dialog Similarity
Figure 2 for We've had this conversation before: A Novel Approach to Measuring Dialog Similarity
Figure 3 for We've had this conversation before: A Novel Approach to Measuring Dialog Similarity
Figure 4 for We've had this conversation before: A Novel Approach to Measuring Dialog Similarity
Viaarxiv icon

Not Enough Data? Deep Learning to the Rescue!

Add code
Nov 27, 2019
Figure 1 for Not Enough Data? Deep Learning to the Rescue!
Figure 2 for Not Enough Data? Deep Learning to the Rescue!
Figure 3 for Not Enough Data? Deep Learning to the Rescue!
Figure 4 for Not Enough Data? Deep Learning to the Rescue!
Viaarxiv icon