Picture for Phu Mon Htut

Phu Mon Htut

Shammie

DiverseAgentEntropy: Quantifying Black-Box LLM Uncertainty through Diverse Perspectives and Multi-Agent Interaction

Add code
Dec 12, 2024
Viaarxiv icon

Open Domain Question Answering with Conflicting Contexts

Add code
Oct 16, 2024
Figure 1 for Open Domain Question Answering with Conflicting Contexts
Figure 2 for Open Domain Question Answering with Conflicting Contexts
Figure 3 for Open Domain Question Answering with Conflicting Contexts
Figure 4 for Open Domain Question Answering with Conflicting Contexts
Viaarxiv icon

RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation

Add code
May 26, 2023
Viaarxiv icon

(QA)$^2$: Question Answering with Questionable Assumptions

Add code
Dec 20, 2022
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

BBQ: A Hand-Built Bias Benchmark for Question Answering

Add code
Oct 15, 2021
Figure 1 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Figure 2 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Figure 3 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Figure 4 for BBQ: A Hand-Built Bias Benchmark for Question Answering
Viaarxiv icon

Comparing Test Sets with Item Response Theory

Add code
Jun 01, 2021
Figure 1 for Comparing Test Sets with Item Response Theory
Figure 2 for Comparing Test Sets with Item Response Theory
Figure 3 for Comparing Test Sets with Item Response Theory
Figure 4 for Comparing Test Sets with Item Response Theory
Viaarxiv icon

English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too

Add code
May 26, 2020
Figure 1 for English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too
Figure 2 for English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too
Figure 3 for English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too
Figure 4 for English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too
Viaarxiv icon

Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?

Add code
May 09, 2020
Figure 1 for Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?
Figure 2 for Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?
Figure 3 for Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?
Figure 4 for Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?
Viaarxiv icon

jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models

Add code
Mar 04, 2020
Figure 1 for jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models
Figure 2 for jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models
Figure 3 for jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models
Viaarxiv icon