Picture for Hengxiang Zhang

Hengxiang Zhang

ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models

Add code
Oct 24, 2024
Figure 1 for ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
Figure 2 for ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
Figure 3 for ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
Figure 4 for ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
Viaarxiv icon

Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning

Add code
Oct 09, 2024
Figure 1 for Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning
Figure 2 for Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning
Figure 3 for Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning
Figure 4 for Defending Membership Inference Attacks via Privacy-aware Sparsity Tuning
Viaarxiv icon

Fine-tuning can Help Detect Pretraining Data from Large Language Models

Add code
Oct 09, 2024
Viaarxiv icon

Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond

Add code
Aug 21, 2024
Figure 1 for Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond
Figure 2 for Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond
Figure 3 for Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond
Figure 4 for Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond
Viaarxiv icon