Picture for Rahul Gupta

Rahul Gupta

Amazon Nova AI Challenge -- Trusted AI: Advancing secure, AI-assisted software development

Add code
Aug 13, 2025
Viaarxiv icon

Retrieval-Augmented Multi-Agent System for Rapid Statement of Work Generation

Add code
Aug 11, 2025
Viaarxiv icon

Customize Multi-modal RAI Guardrails with Precedent-based predictions

Add code
Jul 28, 2025
Viaarxiv icon

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Add code
Jul 03, 2025
Viaarxiv icon

Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation

Add code
May 27, 2025
Viaarxiv icon

Strategize Globally, Adapt Locally: A Multi-Turn Red Teaming Agent with Dual-Level Learning

Add code
Apr 02, 2025
Viaarxiv icon

Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench

Add code
Apr 01, 2025
Viaarxiv icon

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

Add code
Mar 30, 2025
Viaarxiv icon

LUME: LLM Unlearning with Multitask Evaluations

Add code
Feb 20, 2025
Figure 1 for LUME: LLM Unlearning with Multitask Evaluations
Figure 2 for LUME: LLM Unlearning with Multitask Evaluations
Figure 3 for LUME: LLM Unlearning with Multitask Evaluations
Figure 4 for LUME: LLM Unlearning with Multitask Evaluations
Viaarxiv icon

Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification

Add code
Oct 07, 2024
Figure 1 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 2 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 3 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 4 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Viaarxiv icon