Picture for Eugene Jang

Eugene Jang

Improbable Bigrams Expose Vulnerabilities of Incomplete Tokens in Byte-Level Tokenizers

Add code
Oct 31, 2024
Viaarxiv icon

Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain

Add code
Mar 15, 2024
Viaarxiv icon

WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models

Add code
Jun 26, 2023
Figure 1 for WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models
Figure 2 for WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models
Figure 3 for WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models
Figure 4 for WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models
Viaarxiv icon

DarkBERT: A Language Model for the Dark Side of the Internet

Add code
May 18, 2023
Viaarxiv icon

Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models

Add code
Jun 23, 2022
Figure 1 for Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models
Figure 2 for Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models
Figure 3 for Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models
Figure 4 for Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models
Viaarxiv icon

Shedding New Light on the Language of the Dark Web

Add code
Apr 14, 2022
Figure 1 for Shedding New Light on the Language of the Dark Web
Figure 2 for Shedding New Light on the Language of the Dark Web
Figure 3 for Shedding New Light on the Language of the Dark Web
Figure 4 for Shedding New Light on the Language of the Dark Web
Viaarxiv icon