Picture for Rebecca J. Passonneau

Rebecca J. Passonneau

Bellcore

Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling

Add code
Jan 05, 2026
Viaarxiv icon

Concept-based Rubrics Improve LLM Formative Assessment and Data Synthesis

Add code
Apr 04, 2025
Figure 1 for Concept-based Rubrics Improve LLM Formative Assessment and Data Synthesis
Figure 2 for Concept-based Rubrics Improve LLM Formative Assessment and Data Synthesis
Figure 3 for Concept-based Rubrics Improve LLM Formative Assessment and Data Synthesis
Figure 4 for Concept-based Rubrics Improve LLM Formative Assessment and Data Synthesis
Viaarxiv icon

Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet

Add code
Feb 07, 2025
Figure 1 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 2 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 3 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 4 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Viaarxiv icon

Joint Training for Selective Prediction

Add code
Oct 31, 2024
Figure 1 for Joint Training for Selective Prediction
Figure 2 for Joint Training for Selective Prediction
Figure 3 for Joint Training for Selective Prediction
Figure 4 for Joint Training for Selective Prediction
Viaarxiv icon

Improving Model Evaluation using SMART Filtering of Benchmark Datasets

Add code
Oct 26, 2024
Figure 1 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Figure 2 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Figure 3 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Figure 4 for Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Viaarxiv icon

How Well Can You Articulate that Idea? Insights from Automated Formative Assessment

Add code
Apr 17, 2024
Figure 1 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Figure 2 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Figure 3 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Figure 4 for How Well Can You Articulate that Idea? Insights from Automated Formative Assessment
Viaarxiv icon

VerAs: Verify then Assess STEM Lab Reports

Add code
Feb 07, 2024
Figure 1 for VerAs: Verify then Assess STEM Lab Reports
Figure 2 for VerAs: Verify then Assess STEM Lab Reports
Figure 3 for VerAs: Verify then Assess STEM Lab Reports
Figure 4 for VerAs: Verify then Assess STEM Lab Reports
Viaarxiv icon

The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis

Add code
Oct 18, 2023
Figure 1 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 2 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 3 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Figure 4 for The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis
Viaarxiv icon

CALM : A Multi-task Benchmark for Comprehensive Assessment of Language Model Bias

Add code
Aug 24, 2023
Viaarxiv icon

Survey on Sociodemographic Bias in Natural Language Processing

Add code
Jun 27, 2023
Figure 1 for Survey on Sociodemographic Bias in Natural Language Processing
Figure 2 for Survey on Sociodemographic Bias in Natural Language Processing
Figure 3 for Survey on Sociodemographic Bias in Natural Language Processing
Figure 4 for Survey on Sociodemographic Bias in Natural Language Processing
Viaarxiv icon