Picture for George Kour

George Kour

Breaking ReAct Agents: Foot-in-the-Door Attack Will Get You In

Add code
Oct 22, 2024
Viaarxiv icon

Exploring Straightforward Conversational Red-Teaming

Add code
Sep 07, 2024
Viaarxiv icon

Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity

Add code
Aug 22, 2024
Viaarxiv icon

From Zero to Hero: Cold-Start Anomaly Detection

Add code
May 30, 2024
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Viaarxiv icon

Unveiling Safety Vulnerabilities of Large Language Models

Add code
Nov 07, 2023
Viaarxiv icon

Characterizing how 'distributional' NLP corpora distance metrics are

Add code
Oct 23, 2023
Viaarxiv icon

Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

Add code
Nov 29, 2022
Viaarxiv icon

Understanding the Properties of Generated Corpora

Add code
Jun 22, 2022
Figure 1 for Understanding the Properties of Generated Corpora
Figure 2 for Understanding the Properties of Generated Corpora
Figure 3 for Understanding the Properties of Generated Corpora
Figure 4 for Understanding the Properties of Generated Corpora
Viaarxiv icon

Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation

Add code
Dec 22, 2021
Figure 1 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 2 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 3 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 4 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Viaarxiv icon