Picture for Kellin Pelrine

Kellin Pelrine

The Structural Safety Generalization Problem

Add code
Apr 13, 2025
Viaarxiv icon

From Intuition to Understanding: Using AI Peers to Overcome Physics Misconceptions

Add code
Apr 01, 2025
Viaarxiv icon

Epistemic Integrity in Large Language Models

Add code
Nov 10, 2024
Viaarxiv icon

A Guide to Misinformation Detection Datasets

Add code
Nov 07, 2024
Figure 1 for A Guide to Misinformation Detection Datasets
Figure 2 for A Guide to Misinformation Detection Datasets
Figure 3 for A Guide to Misinformation Detection Datasets
Figure 4 for A Guide to Misinformation Detection Datasets
Viaarxiv icon

A Simulation System Towards Solving Societal-Scale Manipulation

Add code
Oct 17, 2024
Figure 1 for A Simulation System Towards Solving Societal-Scale Manipulation
Figure 2 for A Simulation System Towards Solving Societal-Scale Manipulation
Figure 3 for A Simulation System Towards Solving Societal-Scale Manipulation
Viaarxiv icon

Emerging Vulnerabilities in Frontier Models: Multi-Turn Jailbreak Attacks

Add code
Aug 29, 2024
Viaarxiv icon

Scaling Laws for Data Poisoning in LLMs

Add code
Aug 06, 2024
Viaarxiv icon

Can Go AIs be adversarially robust?

Add code
Jun 18, 2024
Viaarxiv icon

Combining Confidence Elicitation and Sample-based Methods for Uncertainty Quantification in Misinformation Mitigation

Add code
Jan 30, 2024
Viaarxiv icon

Comparing GPT-4 and Open-Source Language Models in Misinformation Mitigation

Add code
Jan 12, 2024
Viaarxiv icon