Picture for Luxi He

Luxi He

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Add code
Dec 10, 2024
Viaarxiv icon

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Add code
Jun 26, 2024
Figure 1 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 2 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 3 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 4 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Viaarxiv icon

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

Add code
Jun 20, 2024
Viaarxiv icon

Fantastic Copyrighted Beasts and How (Not) to Generate Them

Add code
Jun 20, 2024
Viaarxiv icon

AI Risk Management Should Incorporate Both Safety and Security

Add code
May 29, 2024
Viaarxiv icon

What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety

Add code
Apr 01, 2024
Viaarxiv icon

Aleatoric and Epistemic Discrimination in Classification

Add code
Jan 27, 2023
Figure 1 for Aleatoric and Epistemic Discrimination in Classification
Figure 2 for Aleatoric and Epistemic Discrimination in Classification
Figure 3 for Aleatoric and Epistemic Discrimination in Classification
Figure 4 for Aleatoric and Epistemic Discrimination in Classification
Viaarxiv icon