Picture for Mihir Kale

Mihir Kale

Shammie

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

Add code
May 24, 2023
Figure 1 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 2 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 3 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 4 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Viaarxiv icon

CLSE: Corpus of Linguistically Significant Entities

Add code
Nov 04, 2022
Viaarxiv icon

Mining Duplicate Questions of Stack Overflow

Add code
Oct 04, 2022
Figure 1 for Mining Duplicate Questions of Stack Overflow
Figure 2 for Mining Duplicate Questions of Stack Overflow
Figure 3 for Mining Duplicate Questions of Stack Overflow
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

XTREME-S: Evaluating Cross-lingual Speech Representations

Add code
Apr 13, 2022
Figure 1 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 2 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 3 for XTREME-S: Evaluating Cross-lingual Speech Representations
Figure 4 for XTREME-S: Evaluating Cross-lingual Speech Representations
Viaarxiv icon

Improving Compositional Generalization with Self-Training for Data-to-Text Generation

Add code
Oct 16, 2021
Figure 1 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Figure 2 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Figure 3 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Figure 4 for Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Viaarxiv icon

Using Machine Translation to Localize Task Oriented NLG Output

Add code
Jul 09, 2021
Figure 1 for Using Machine Translation to Localize Task Oriented NLG Output
Figure 2 for Using Machine Translation to Localize Task Oriented NLG Output
Figure 3 for Using Machine Translation to Localize Task Oriented NLG Output
Figure 4 for Using Machine Translation to Localize Task Oriented NLG Output
Viaarxiv icon

Automatic Construction of Evaluation Suites for Natural Language Generation Datasets

Add code
Jun 16, 2021
Figure 1 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Figure 2 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Figure 3 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Figure 4 for Automatic Construction of Evaluation Suites for Natural Language Generation Datasets
Viaarxiv icon

nmT5 -- Is parallel data still relevant for pre-training massively multilingual language models?

Add code
Jun 03, 2021
Figure 1 for nmT5 -- Is parallel data still relevant for pre-training massively multilingual language models?
Figure 2 for nmT5 -- Is parallel data still relevant for pre-training massively multilingual language models?
Figure 3 for nmT5 -- Is parallel data still relevant for pre-training massively multilingual language models?
Figure 4 for nmT5 -- Is parallel data still relevant for pre-training massively multilingual language models?
Viaarxiv icon

ByT5: Towards a token-free future with pre-trained byte-to-byte models

Add code
May 28, 2021
Figure 1 for ByT5: Towards a token-free future with pre-trained byte-to-byte models
Figure 2 for ByT5: Towards a token-free future with pre-trained byte-to-byte models
Figure 3 for ByT5: Towards a token-free future with pre-trained byte-to-byte models
Figure 4 for ByT5: Towards a token-free future with pre-trained byte-to-byte models
Viaarxiv icon