Picture for Mengdi Wang

Mengdi Wang

LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds

Add code
Dec 06, 2024
Viaarxiv icon

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon

One-Layer Transformer Provably Learns One-Nearest Neighbor In Context

Add code
Nov 16, 2024
Viaarxiv icon

CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR

Add code
Nov 07, 2024
Viaarxiv icon

Global Convergence in Training Large-Scale Transformers

Add code
Oct 31, 2024
Figure 1 for Global Convergence in Training Large-Scale Transformers
Figure 2 for Global Convergence in Training Large-Scale Transformers
Viaarxiv icon

A Theoretical Perspective for Speculative Decoding Algorithm

Add code
Oct 30, 2024
Viaarxiv icon

FoldMark: Protecting Protein Generative Models with Watermarking

Add code
Oct 27, 2024
Viaarxiv icon

Fast Best-of-N Decoding via Speculative Rejection

Add code
Oct 26, 2024
Viaarxiv icon

Long Term Memory: The Foundation of AI Self-Evolution

Add code
Oct 21, 2024
Viaarxiv icon

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Add code
Oct 18, 2024
Figure 1 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 2 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 3 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Figure 4 for TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling
Viaarxiv icon