Picture for Eitan Farchi

Eitan Farchi

Exploring Straightforward Conversational Red-Teaming

Add code
Sep 07, 2024
Viaarxiv icon

Can You Trust Your Metric? Automatic Concatenation-Based Tests for Metric Validity

Add code
Aug 22, 2024
Viaarxiv icon

A Novel Metric for Measuring the Robustness of Large Language Models in Non-adversarial Scenarios

Add code
Aug 04, 2024
Viaarxiv icon

Generating Unseen Code Tests In Infinitum

Add code
Jul 29, 2024
Viaarxiv icon

Using Combinatorial Optimization to Design a High quality LLM Solution

Add code
May 15, 2024
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Viaarxiv icon

Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations

Add code
Mar 08, 2024
Viaarxiv icon

Unveiling Safety Vulnerabilities of Large Language Models

Add code
Nov 07, 2023
Viaarxiv icon

Predicting Question-Answering Performance of Large Language Models through Semantic Consistency

Add code
Nov 02, 2023
Viaarxiv icon

Characterizing how 'distributional' NLP corpora distance metrics are

Add code
Oct 23, 2023
Viaarxiv icon