Picture for Tianlong Li

Tianlong Li

Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing

Add code
Sep 25, 2024
Figure 1 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 2 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 3 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 4 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Viaarxiv icon

Towards Biologically Plausible Computing: A Comprehensive Comparison

Add code
Jun 23, 2024
Figure 1 for Towards Biologically Plausible Computing: A Comprehensive Comparison
Figure 2 for Towards Biologically Plausible Computing: A Comprehensive Comparison
Figure 3 for Towards Biologically Plausible Computing: A Comprehensive Comparison
Figure 4 for Towards Biologically Plausible Computing: A Comprehensive Comparison
Viaarxiv icon

Promoting Data and Model Privacy in Federated Learning through Quantized LoRA

Add code
Jun 16, 2024
Viaarxiv icon

MetaRM: Shifted Distributions Alignment via Meta-Learning

Add code
May 01, 2024
Viaarxiv icon

Advancing Parameter Efficiency in Fine-tuning via Representation Editing

Add code
Feb 28, 2024
Viaarxiv icon

Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering

Add code
Jan 12, 2024
Figure 1 for Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering
Figure 2 for Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering
Figure 3 for Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering
Figure 4 for Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering
Viaarxiv icon

Aligning Large Language Models with Human Preferences through Representation Engineering

Add code
Dec 26, 2023
Figure 1 for Aligning Large Language Models with Human Preferences through Representation Engineering
Figure 2 for Aligning Large Language Models with Human Preferences through Representation Engineering
Figure 3 for Aligning Large Language Models with Human Preferences through Representation Engineering
Figure 4 for Aligning Large Language Models with Human Preferences through Representation Engineering
Viaarxiv icon

Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons

Add code
Oct 25, 2023
Figure 1 for Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons
Figure 2 for Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons
Viaarxiv icon

SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network

Add code
Oct 12, 2023
Figure 1 for SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network
Figure 2 for SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network
Figure 3 for SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network
Figure 4 for SpikeCLIP: A Contrastive Language-Image Pretrained Spiking Neural Network
Viaarxiv icon

SpikeBERT: A Language Spikformer Trained with Two-Stage Knowledge Distillation from BERT

Add code
Aug 30, 2023
Figure 1 for SpikeBERT: A Language Spikformer Trained with Two-Stage Knowledge Distillation from BERT
Figure 2 for SpikeBERT: A Language Spikformer Trained with Two-Stage Knowledge Distillation from BERT
Figure 3 for SpikeBERT: A Language Spikformer Trained with Two-Stage Knowledge Distillation from BERT
Figure 4 for SpikeBERT: A Language Spikformer Trained with Two-Stage Knowledge Distillation from BERT
Viaarxiv icon