Picture for Rebecca J. Passonneau

Rebecca J. Passonneau

Bellcore

Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling

Add code
Jan 05, 2026
Viaarxiv icon

Concept-based Rubrics Improve LLM Formative Assessment and Data Synthesis

Add code
Apr 04, 2025
Viaarxiv icon

Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet

Add code
Feb 07, 2025
Figure 1 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 2 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 3 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 4 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Viaarxiv icon

Joint Training for Selective Prediction

Add code
Oct 31, 2024
Figure 1 for Joint Training for Selective Prediction
Figure 2 for Joint Training for Selective Prediction
Figure 3 for Joint Training for Selective Prediction
Figure 4 for Joint Training for Selective Prediction
Viaarxiv icon

Improving Model Evaluation using SMART Filtering of Benchmark Datasets

Add code
Oct 26, 2024
Figure 1 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Figure 2 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Figure 3 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Figure 4 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Viaarxiv icon

How Well Can You Articulate that Idea? Insights from Automated Formative Assessment

Add code
Apr 17, 2024
Figure 1 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Figure 2 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Figure 3 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Figure 4 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Viaarxiv icon

VerAs: Verify then Assess STEM Lab Reports

Add code
Feb 07, 2024
Figure 1 for VerAs: Verify then Assess STEM Lab Reports
Figure 2 for VerAs: Verify then Assess STEM Lab Reports
Figure 3 for VerAs: Verify then Assess STEM Lab Reports
Figure 4 for VerAs: Verify then Assess STEM Lab Reports
Viaarxiv icon

The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis

Add code
Oct 18, 2023
Figure 1 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 2 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 3 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 4 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Viaarxiv icon

CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias

Add code
Aug 24, 2023
Viaarxiv icon

Survey on Sociodemographic Bias in Natural Language Processing

Add code
Jun 27, 2023
Figure 1 for Survey on Sociodemographic Bias in Natural Language Processing
Figure 2 for Survey on Sociodemographic Bias in Natural Language Processing
Figure 3 for Survey on Sociodemographic Bias in Natural Language Processing
Figure 4 for Survey on Sociodemographic Bias in Natural Language Processing
Viaarxiv icon