Picture for Sarthak Roy

Sarthak Roy

Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

Add code
Jun 27, 2024
Viaarxiv icon

Probing LLMs for hate speech detection: strengths and vulnerabilities

Add code
Oct 28, 2023
Figure 1 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Figure 2 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Figure 3 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Figure 4 for Probing LLMs for hate speech detection: strengths and vulnerabilities
Viaarxiv icon