Picture for Maxine Eskenazi

Maxine Eskenazi

EJ

Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation

Add code
Jan 27, 2023
Viaarxiv icon

The DialPort tools

Add code
Aug 18, 2022
Figure 1 for The DialPort tools
Figure 2 for The DialPort tools
Figure 3 for The DialPort tools
Figure 4 for The DialPort tools
Viaarxiv icon

Interactive Evaluation of Dialog Track at DSTC9

Add code
Jul 28, 2022
Figure 1 for Interactive Evaluation of Dialog Track at DSTC9
Figure 2 for Interactive Evaluation of Dialog Track at DSTC9
Figure 3 for Interactive Evaluation of Dialog Track at DSTC9
Figure 4 for Interactive Evaluation of Dialog Track at DSTC9
Viaarxiv icon

LAD: Language Models as Data for Zero-Shot Dialog

Add code
Jul 28, 2022
Figure 1 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 2 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 3 for LAD: Language Models as Data for Zero-Shot Dialog
Figure 4 for LAD: Language Models as Data for Zero-Shot Dialog
Viaarxiv icon

DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit

Add code
Jul 25, 2022
Figure 1 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 2 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 3 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Figure 4 for DialCrowd 2.0: A Quality-Focused Dialog System Crowdsourcing Toolkit
Viaarxiv icon

Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning

Add code
May 25, 2022
Figure 1 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 2 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 3 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Figure 4 for Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
Viaarxiv icon

Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges

Add code
Mar 18, 2022
Figure 1 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Figure 2 for Report from the NSF Future Directions Workshop on Automatic Evaluation of Dialog: Research Directions and Challenges
Viaarxiv icon

A Survey of NLP-Related Crowdsourcing HITs: what works and what does not

Add code
Nov 09, 2021
Figure 1 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 2 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 3 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Figure 4 for A Survey of NLP-Related Crowdsourcing HITs: what works and what does not
Viaarxiv icon

A Comprehensive Assessment of Dialog Evaluation Metrics

Add code
Jun 30, 2021
Figure 1 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 2 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 3 for A Comprehensive Assessment of Dialog Evaluation Metrics
Figure 4 for A Comprehensive Assessment of Dialog Evaluation Metrics
Viaarxiv icon

Schema-Guided Paradigm for Zero-Shot Dialog

Add code
Jun 13, 2021
Figure 1 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 2 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 3 for Schema-Guided Paradigm for Zero-Shot Dialog
Figure 4 for Schema-Guided Paradigm for Zero-Shot Dialog
Viaarxiv icon