Picture for Tomasz Limisiewicz

Tomasz Limisiewicz

Dual Debiasing: Remove Stereotypes and Keep Factual Gender for Fair Language Modeling and Translation

Add code
Jan 17, 2025
Viaarxiv icon

Teaching LLMs at Charles University: Assignments and Activities

Add code
Jul 29, 2024
Viaarxiv icon

MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization

Add code
Jul 11, 2024
Figure 1 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 2 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 3 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 4 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Viaarxiv icon

MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling

Add code
Mar 15, 2024
Viaarxiv icon

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models

Add code
Jan 19, 2024
Figure 1 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 2 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 3 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Figure 4 for Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models
Viaarxiv icon

Debiasing Algorithm through Model Adaptation

Add code
Oct 29, 2023
Viaarxiv icon

Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation

Add code
Sep 30, 2023
Viaarxiv icon

Tokenization Impacts Multilingual Language Modeling: Assessing Vocabulary Allocation and Overlap Across Languages

Add code
May 26, 2023
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models

Add code
Oct 13, 2022
Figure 1 for You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Figure 2 for You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Figure 3 for You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Figure 4 for You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models
Viaarxiv icon