Picture for Mona Diab

Mona Diab

Carnegie Mellon University

Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models

Add code
Nov 01, 2024
Figure 1 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Figure 2 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Figure 3 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Figure 4 for Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Viaarxiv icon

BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data

Add code
Oct 21, 2024
Figure 1 for BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data
Figure 2 for BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data
Figure 3 for BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data
Figure 4 for BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data
Viaarxiv icon

The FIGNEWS Shared Task on News Media Narratives

Add code
Jul 25, 2024
Figure 1 for The FIGNEWS Shared Task on News Media Narratives
Figure 2 for The FIGNEWS Shared Task on News Media Narratives
Figure 3 for The FIGNEWS Shared Task on News Media Narratives
Figure 4 for The FIGNEWS Shared Task on News Media Narratives
Viaarxiv icon

Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

Add code
Jun 25, 2024
Viaarxiv icon

Evaluating Large Language Model Biases in Persona-Steered Generation

Add code
May 30, 2024
Viaarxiv icon

Automatic Generation of Model and Data Cards: A Step Towards Responsible AI

Add code
May 10, 2024
Viaarxiv icon

Analyzing the Role of Semantic Representations in the Era of Large Language Models

Add code
May 02, 2024
Viaarxiv icon

Emotion Classification in Low and Moderate Resource Languages

Add code
Feb 28, 2024
Viaarxiv icon

Investigating Cultural Alignment of Large Language Models

Add code
Feb 20, 2024
Figure 1 for Investigating Cultural Alignment of Large Language Models
Figure 2 for Investigating Cultural Alignment of Large Language Models
Figure 3 for Investigating Cultural Alignment of Large Language Models
Figure 4 for Investigating Cultural Alignment of Large Language Models
Viaarxiv icon

A Note on Bias to Complete

Add code
Feb 18, 2024
Viaarxiv icon