Picture for Bhavya Kailkhura

Bhavya Kailkhura

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

Add code
Mar 13, 2025
Viaarxiv icon

Constrained Language Generation with Discrete Diffusion Models

Add code
Mar 12, 2025
Viaarxiv icon

GRNFormer: A Biologically-Guided Framework for Integrating Gene Regulatory Networks into RNA Foundation Models

Add code
Mar 03, 2025
Viaarxiv icon

EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants

Add code
Feb 27, 2025
Viaarxiv icon

Extracting and Understanding the Superficial Knowledge in Alignment

Add code
Feb 07, 2025
Figure 1 for Extracting and Understanding the Superficial Knowledge in Alignment
Figure 2 for Extracting and Understanding the Superficial Knowledge in Alignment
Figure 3 for Extracting and Understanding the Superficial Knowledge in Alignment
Figure 4 for Extracting and Understanding the Superficial Knowledge in Alignment
Viaarxiv icon

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Add code
Feb 07, 2025
Viaarxiv icon

Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness

Add code
Jan 16, 2025
Viaarxiv icon

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

Add code
Jan 05, 2025
Viaarxiv icon

Active Learning Enables Extrapolation in Molecular Generative Models

Add code
Jan 03, 2025
Viaarxiv icon

Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis

Add code
Oct 12, 2024
Figure 1 for Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis
Viaarxiv icon