Picture for Sahajpreet Singh

Sahajpreet Singh

Hate Personified: Investigating the role of LLMs in content moderation

Add code
Oct 03, 2024
Viaarxiv icon

Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF

Add code
Mar 15, 2024
Viaarxiv icon