Picture for Jianfeng Chi

Jianfeng Chi

Jack

Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations

Add code
Nov 15, 2024
Figure 1 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Figure 2 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Figure 3 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Figure 4 for Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations
Viaarxiv icon

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks

Add code
Oct 23, 2024
Viaarxiv icon

Persistent Pre-Training Poisoning of LLMs

Add code
Oct 17, 2024
Viaarxiv icon

Backtracking Improves Generation Safety

Add code
Sep 22, 2024
Figure 1 for Backtracking Improves Generation Safety
Figure 2 for Backtracking Improves Generation Safety
Figure 3 for Backtracking Improves Generation Safety
Figure 4 for Backtracking Improves Generation Safety
Viaarxiv icon

BadMerging: Backdoor Attacks Against Model Merging

Add code
Aug 14, 2024
Figure 1 for BadMerging: Backdoor Attacks Against Model Merging
Figure 2 for BadMerging: Backdoor Attacks Against Model Merging
Figure 3 for BadMerging: Backdoor Attacks Against Model Merging
Figure 4 for BadMerging: Backdoor Attacks Against Model Merging
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction

Add code
Jun 10, 2024
Viaarxiv icon

Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations

Add code
Dec 07, 2023
Figure 1 for Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Figure 2 for Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Figure 3 for Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Figure 4 for Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Viaarxiv icon

Where have you been? A Study of Privacy Risk for Point-of-Interest Recommendation

Add code
Oct 28, 2023
Viaarxiv icon

FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods

Add code
Jun 15, 2023
Viaarxiv icon