Picture for Colin Leong

Colin Leong

Enhancing Multi-Domain Automatic Short Answer Grading through an Explainable Neuro-Symbolic Pipeline

Add code
Mar 04, 2024
Viaarxiv icon

JWSign: A Highly Multilingual Corpus of Bible Translations for more Diversity in Sign Language Processing

Add code
Nov 16, 2023
Viaarxiv icon

The eBible Corpus: Data and Model Benchmarks for Bible Translation for Low-Resource Languages

Add code
Apr 19, 2023
Viaarxiv icon

Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages

Add code
Mar 29, 2023
Figure 1 for Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages
Figure 2 for Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks

Add code
Oct 26, 2022
Viaarxiv icon

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

Add code
Jul 07, 2022
Figure 1 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 2 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 3 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Figure 4 for BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Viaarxiv icon

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

Add code
May 04, 2022
Figure 1 for A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
Figure 2 for A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
Figure 3 for A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
Figure 4 for A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
Viaarxiv icon

Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources

Add code
Jan 25, 2022
Figure 1 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Figure 2 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Figure 3 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Figure 4 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Viaarxiv icon

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

Add code
Mar 22, 2021
Figure 1 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 2 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 3 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Figure 4 for Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Viaarxiv icon