Picture for George Kour

George Kour

Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In

Add code
Oct 22, 2024
Figure 1 for Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In
Figure 2 for Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In
Figure 3 for Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In
Figure 4 for Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In
Viaarxiv icon

Exploring Straightforward Conversational Red-Teaming

Add code
Sep 07, 2024
Figure 1 for Exploring Straightforward Conversational Red-Teaming
Figure 2 for Exploring Straightforward Conversational Red-Teaming
Figure 3 for Exploring Straightforward Conversational Red-Teaming
Figure 4 for Exploring Straightforward Conversational Red-Teaming
Viaarxiv icon

Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity

Add code
Aug 22, 2024
Figure 1 for Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity
Figure 2 for Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity
Figure 3 for Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity
Figure 4 for Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity
Viaarxiv icon

From Zero to Hero: Cold-Start Anomaly Detection

Add code
May 30, 2024
Figure 1 for From Zero to Hero: Cold-Start Anomaly Detection
Figure 2 for From Zero to Hero: Cold-Start Anomaly Detection
Figure 3 for From Zero to Hero: Cold-Start Anomaly Detection
Figure 4 for From Zero to Hero: Cold-Start Anomaly Detection
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Figure 1 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 2 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 3 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 4 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Viaarxiv icon

Unveiling Safety Vulnerabilities of Large Language Models

Add code
Nov 07, 2023
Figure 1 for Unveiling Safety Vulnerabilities of Large Language Models
Figure 2 for Unveiling Safety Vulnerabilities of Large Language Models
Figure 3 for Unveiling Safety Vulnerabilities of Large Language Models
Figure 4 for Unveiling Safety Vulnerabilities of Large Language Models
Viaarxiv icon

Characterizing how 'distributional' NLP corpora distance metrics are

Add code
Oct 23, 2023
Viaarxiv icon

Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

Add code
Nov 29, 2022
Viaarxiv icon

Understanding the Properties of Generated Corpora

Add code
Jun 22, 2022
Figure 1 for Understanding the Properties of Generated Corpora
Figure 2 for Understanding the Properties of Generated Corpora
Figure 3 for Understanding the Properties of Generated Corpora
Figure 4 for Understanding the Properties of Generated Corpora
Viaarxiv icon

Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation

Add code
Dec 22, 2021
Figure 1 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 2 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 3 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 4 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Viaarxiv icon