Picture for Rahul Gupta

Rahul Gupta

Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models

Add code
Oct 07, 2024
Figure 1 for Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
Figure 2 for Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
Figure 3 for Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
Figure 4 for Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models
Viaarxiv icon

Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification

Add code
Oct 07, 2024
Figure 1 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 2 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 3 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Figure 4 for Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification
Viaarxiv icon

Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs

Add code
Jul 31, 2024
Viaarxiv icon

Quantitative Certification of Bias in Large Language Models

Add code
May 29, 2024
Viaarxiv icon

TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality

Add code
Apr 27, 2024
Viaarxiv icon

Toward Informal Language Processing: Knowledge of Slang in Large Language Models

Add code
Apr 13, 2024
Figure 1 for Toward Informal Language Processing: Knowledge of Slang in Large Language Models
Figure 2 for Toward Informal Language Processing: Knowledge of Slang in Large Language Models
Figure 3 for Toward Informal Language Processing: Knowledge of Slang in Large Language Models
Figure 4 for Toward Informal Language Processing: Knowledge of Slang in Large Language Models
Viaarxiv icon

Partial Federated Learning

Add code
Mar 03, 2024
Viaarxiv icon

Are you talking to or ? On Tokenization and Addressing Misgendering in LLMs with Pronoun Tokenization Parity

Add code
Dec 21, 2023
Viaarxiv icon

Faithful Model Evaluation for Model-Based Metrics

Add code
Dec 19, 2023
Viaarxiv icon

JAB: Joint Adversarial Prompting and Belief Augmentation

Add code
Nov 16, 2023
Viaarxiv icon