Picture for Ateret Anaby-Tavor

Ateret Anaby-Tavor

Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In

Add code
Oct 22, 2024
Viaarxiv icon

Exploring Straightforward Conversational Red-Teaming

Add code
Sep 07, 2024
Viaarxiv icon

A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios

Add code
Aug 04, 2024
Viaarxiv icon

From Zero to Hero: Cold-Start Anomaly Detection

Add code
May 30, 2024
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Viaarxiv icon

What's the Plan? Evaluating and Developing Planning-Aware Techniques for LLMs

Add code
Feb 18, 2024
Viaarxiv icon

SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models

Add code
Feb 18, 2024
Viaarxiv icon

Unveiling Safety Vulnerabilities of Large Language Models

Add code
Nov 07, 2023
Viaarxiv icon

Predicting Question-Answering Performance of Large Language Models through Semantic Consistency

Add code
Nov 02, 2023
Viaarxiv icon

Reliable and Interpretable Drift Detection in Streams of Short Texts

Add code
May 28, 2023
Viaarxiv icon