Picture for Amrit Singh Bedi

Amrit Singh Bedi

LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds

Add code
Dec 06, 2024
Viaarxiv icon

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon

Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction

Add code
Nov 01, 2024
Viaarxiv icon

EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering

Add code
Oct 26, 2024
Figure 1 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Figure 2 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Figure 3 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Figure 4 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Viaarxiv icon

On The Global Convergence Of Online RLHF With Neural Parametrization

Add code
Oct 21, 2024
Viaarxiv icon

On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning

Add code
Oct 05, 2024
Figure 1 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 2 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 3 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Figure 4 for On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning
Viaarxiv icon

AIME: AI System Optimization via Multiple LLM Evaluators

Add code
Oct 04, 2024
Viaarxiv icon

Auction-Based Regulation for Artificial Intelligence

Add code
Oct 02, 2024
Figure 1 for Auction-Based Regulation for Artificial Intelligence
Figure 2 for Auction-Based Regulation for Artificial Intelligence
Figure 3 for Auction-Based Regulation for Artificial Intelligence
Figure 4 for Auction-Based Regulation for Artificial Intelligence
Viaarxiv icon

CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk

Add code
Aug 16, 2024
Figure 1 for CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk
Figure 2 for CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk
Figure 3 for CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk
Figure 4 for CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk
Viaarxiv icon

TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation

Add code
Aug 03, 2024
Figure 1 for TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation
Figure 2 for TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation
Figure 3 for TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation
Figure 4 for TrustNavGPT: Modeling Uncertainty to Improve Trustworthiness of Audio-Guided LLM-Based Robot Navigation
Viaarxiv icon