Picture for Brian Roark

Brian Roark

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

Add code
May 24, 2023
Figure 1 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 2 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 3 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 4 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Viaarxiv icon

Spelling convention sensitivity in neural language models

Add code
Mar 06, 2023
Viaarxiv icon

Beyond Arabic: Software for Perso-Arabic Script Manipulation

Add code
Jan 26, 2023
Viaarxiv icon

Structured abbreviation expansion in context

Add code
Oct 04, 2021
Figure 1 for Structured abbreviation expansion in context
Figure 2 for Structured abbreviation expansion in context
Figure 3 for Structured abbreviation expansion in context
Figure 4 for Structured abbreviation expansion in context
Viaarxiv icon

Finding Concept-specific Biases in Form--Meaning Associations

Add code
Apr 29, 2021
Figure 1 for Finding Concept-specific Biases in Form--Meaning Associations
Figure 2 for Finding Concept-specific Biases in Form--Meaning Associations
Figure 3 for Finding Concept-specific Biases in Form--Meaning Associations
Figure 4 for Finding Concept-specific Biases in Form--Meaning Associations
Viaarxiv icon

Disambiguatory Signals are Stronger in Word-initial Positions

Add code
Feb 03, 2021
Figure 1 for Disambiguatory Signals are Stronger in Word-initial Positions
Viaarxiv icon

Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset

Add code
Jul 02, 2020
Figure 1 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 2 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 3 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Figure 4 for Processing South Asian Languages Written in the Latin Script: the Dakshina Dataset
Viaarxiv icon

Phonotactic Complexity and its Trade-offs

Add code
May 07, 2020
Viaarxiv icon

Language-agnostic Multilingual Modeling

Add code
Apr 20, 2020
Figure 1 for Language-agnostic Multilingual Modeling
Figure 2 for Language-agnostic Multilingual Modeling
Figure 3 for Language-agnostic Multilingual Modeling
Figure 4 for Language-agnostic Multilingual Modeling
Viaarxiv icon

Meaning to Form: Measuring Systematicity as Information

Add code
Jul 26, 2019
Figure 1 for Meaning to Form: Measuring Systematicity as Information
Figure 2 for Meaning to Form: Measuring Systematicity as Information
Figure 3 for Meaning to Form: Measuring Systematicity as Information
Figure 4 for Meaning to Form: Measuring Systematicity as Information
Viaarxiv icon