Picture for Sabrina J. Mielke

Sabrina J. Mielke

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

UniMorph 4.0: Universal Morphology

Add code
May 10, 2022
Figure 1 for UniMorph 4.0: Universal Morphology
Figure 2 for UniMorph 4.0: Universal Morphology
Figure 3 for UniMorph 4.0: Universal Morphology
Figure 4 for UniMorph 4.0: Universal Morphology
Viaarxiv icon

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

Add code
Dec 20, 2021
Figure 1 for Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Viaarxiv icon

SIGTYP 2021 Shared Task: Robust Spoken Language Identification

Add code
Jun 07, 2021
Figure 1 for SIGTYP 2021 Shared Task: Robust Spoken Language Identification
Figure 2 for SIGTYP 2021 Shared Task: Robust Spoken Language Identification
Figure 3 for SIGTYP 2021 Shared Task: Robust Spoken Language Identification
Figure 4 for SIGTYP 2021 Shared Task: Robust Spoken Language Identification
Viaarxiv icon

Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness

Add code
Dec 30, 2020
Figure 1 for Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness
Figure 2 for Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness
Figure 3 for Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness
Figure 4 for Linguistic calibration through metacognition: aligning dialogue agent responses with expected correctness
Viaarxiv icon

SIGTYP 2020 Shared Task: Prediction of Typological Features

Add code
Oct 26, 2020
Figure 1 for SIGTYP 2020 Shared Task: Prediction of Typological Features
Figure 2 for SIGTYP 2020 Shared Task: Prediction of Typological Features
Figure 3 for SIGTYP 2020 Shared Task: Prediction of Typological Features
Figure 4 for SIGTYP 2020 Shared Task: Prediction of Typological Features
Viaarxiv icon

SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection

Add code
Jul 14, 2020
Figure 1 for SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
Figure 2 for SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
Figure 3 for SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
Figure 4 for SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
Viaarxiv icon

Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset

Add code
Jul 02, 2020
Figure 1 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 2 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 3 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 4 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Viaarxiv icon

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

Add code
May 17, 2020
Figure 1 for It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
Figure 2 for It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
Figure 3 for It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
Figure 4 for It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
Viaarxiv icon

Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!

Add code
Apr 30, 2020
Figure 1 for Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!
Figure 2 for Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!
Figure 3 for Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!
Figure 4 for Tired of Topic Models? Clusters of Pretrained Word Embeddings Make for Fast and Good Topics too!
Viaarxiv icon