Picture for Skyler Seto

Skyler Seto

Multilingual Knowledge Transfer under Data Constraints via Lexical Interventions

Add code
May 22, 2026
Viaarxiv icon

Mix, Don't Tune: Bilingual Pre-Training Outperforms Hyperparameter Search in Data-Constrained Settings

Add code
May 13, 2026
Viaarxiv icon

Scaling Laws for Mixture Pretraining Under Data Constraints

Add code
May 12, 2026
Viaarxiv icon

Optimal Splitting of Language Models from Mixtures to Specialized Domains

Add code
Mar 19, 2026
Viaarxiv icon

Assessing the Role of Data Quality in Training Bilingual Language Models

Add code
Jun 15, 2025
Figure 1 for Assessing the Role of Data Quality in Training Bilingual Language Models
Figure 2 for Assessing the Role of Data Quality in Training Bilingual Language Models
Figure 3 for Assessing the Role of Data Quality in Training Bilingual Language Models
Figure 4 for Assessing the Role of Data Quality in Training Bilingual Language Models
Viaarxiv icon

Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting

Add code
May 30, 2025
Figure 1 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Figure 2 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Figure 3 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Figure 4 for Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Viaarxiv icon

Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks

Add code
May 23, 2025
Viaarxiv icon

Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models

Add code
Feb 21, 2025
Viaarxiv icon

Analyze the Neurons, not the Embeddings: Understanding When and Where LLM Representations Align with Humans

Add code
Feb 20, 2025
Viaarxiv icon

Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging

Add code
Feb 03, 2025
Viaarxiv icon