Picture for Sahajpreet Singh

Sahajpreet Singh

Hate Personified: Investigating the role of LLMs in content moderation

Add code
Oct 03, 2024
Figure 1 for Hate Personified: Investigating the role of LLMs in content moderation
Figure 2 for Hate Personified: Investigating the role of LLMs in content moderation
Figure 3 for Hate Personified: Investigating the role of LLMs in content moderation
Figure 4 for Hate Personified: Investigating the role of LLMs in content moderation
Viaarxiv icon

Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF

Add code
Mar 15, 2024
Viaarxiv icon