Picture for Leon Derczynski

Leon Derczynski

Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities

Add code
Jan 31, 2025
Figure 1 for Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities
Figure 2 for Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities
Figure 3 for Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities
Figure 4 for Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

garak: A Framework for Security Probing Large Language Models

Add code
Jun 16, 2024
Figure 1 for garak: A Framework for Security Probing Large Language Models
Figure 2 for garak: A Framework for Security Probing Large Language Models
Figure 3 for garak: A Framework for Security Probing Large Language Models
Figure 4 for garak: A Framework for Security Probing Large Language Models
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild

Add code
Nov 13, 2023
Viaarxiv icon

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research

Add code
Jun 29, 2023
Figure 1 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Figure 2 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Figure 3 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Figure 4 for Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research
Viaarxiv icon

Assessing Language Model Deployment with Risk Cards

Add code
Mar 31, 2023
Figure 1 for Assessing Language Model Deployment with Risk Cards
Figure 2 for Assessing Language Model Deployment with Risk Cards
Viaarxiv icon

Training a T5 Using Lab-sized Resources

Add code
Aug 25, 2022
Figure 1 for Training a T5 Using Lab-sized Resources
Figure 2 for Training a T5 Using Lab-sized Resources
Viaarxiv icon

Sparse Probability of Agreement

Add code
Aug 12, 2022
Figure 1 for Sparse Probability of Agreement
Figure 2 for Sparse Probability of Agreement
Figure 3 for Sparse Probability of Agreement
Figure 4 for Sparse Probability of Agreement
Viaarxiv icon

The ITU Faroese Pairs Dataset

Add code
Jun 17, 2022
Viaarxiv icon