Picture for Bhavya Kailkhura

Bhavya Kailkhura

Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness

Add code
Jan 16, 2025
Viaarxiv icon

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

Add code
Jan 05, 2025
Viaarxiv icon

Active Learning Enables Extrapolation in Molecular Generative Models

Add code
Jan 03, 2025
Viaarxiv icon

Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis

Add code
Oct 12, 2024
Figure 1 for Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis
Viaarxiv icon

Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion

Add code
Aug 10, 2024
Viaarxiv icon

ELFS: Enhancing Label-Free Coreset Selection via Clustering-based Pseudo-Labeling

Add code
Jun 06, 2024
Viaarxiv icon

Low-rank finetuning for LLMs: A fairness perspective

Add code
May 28, 2024
Figure 1 for Low-rank finetuning for LLMs: A fairness perspective
Figure 2 for Low-rank finetuning for LLMs: A fairness perspective
Figure 3 for Low-rank finetuning for LLMs: A fairness perspective
Figure 4 for Low-rank finetuning for LLMs: A fairness perspective
Viaarxiv icon

Transformers Can Do Arithmetic with the Right Embeddings

Add code
May 27, 2024
Figure 1 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 2 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 3 for Transformers Can Do Arithmetic with the Right Embeddings
Figure 4 for Transformers Can Do Arithmetic with the Right Embeddings
Viaarxiv icon

SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning

Add code
Apr 28, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon