Picture for Kellin Pelrine

Kellin Pelrine

Epistemic Integrity in Large Language Models

Add code
Nov 10, 2024
Viaarxiv icon

A Guide to Misinformation Detection Datasets

Add code
Nov 07, 2024
Viaarxiv icon

A Simulation System Towards Solving Societal-Scale Manipulation

Add code
Oct 17, 2024
Figure 1 for A Simulation System Towards Solving Societal-Scale Manipulation
Figure 2 for A Simulation System Towards Solving Societal-Scale Manipulation
Figure 3 for A Simulation System Towards Solving Societal-Scale Manipulation
Viaarxiv icon

Emerging Vulnerabilities in Frontier Models: Multi-Turn Jailbreak Attacks

Add code
Aug 29, 2024
Viaarxiv icon

Scaling Laws for Data Poisoning in LLMs

Add code
Aug 06, 2024
Viaarxiv icon

Can Go AIs be adversarially robust?

Add code
Jun 18, 2024
Viaarxiv icon

Combining Confidence Elicitation and Sample-based Methods for Uncertainty Quantification in Misinformation Mitigation

Add code
Jan 30, 2024
Viaarxiv icon

Comparing GPT-4 and Open-Source Language Models in Misinformation Mitigation

Add code
Jan 12, 2024
Viaarxiv icon

Uncertainty Resolution in Misinformation Detection

Add code
Jan 02, 2024
Viaarxiv icon

Exploiting Novel GPT-4 APIs

Add code
Dec 21, 2023
Viaarxiv icon