Picture for Abhinav Rao

Abhinav Rao

Jailbreak Paradox: The Achilles' Heel of LLMs

Add code
Jun 18, 2024
Viaarxiv icon

NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models

Add code
Apr 18, 2024
Viaarxiv icon

Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs

Add code
Oct 11, 2023
Viaarxiv icon

Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks

Add code
May 24, 2023
Viaarxiv icon

Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin

Add code
Dec 10, 2022
Viaarxiv icon