Picture for Andrew Caines

Andrew Caines

Bias Dynamics in BabyLMs: Towards a Compute-Efficient Sandbox for Democratising Pre-Training Debiasing

Add code
Jan 15, 2026
Viaarxiv icon

Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction

Add code
Oct 23, 2025
Viaarxiv icon

DACTYL: Diverse Adversarial Corpus of Texts Yielded from Large Language Models

Add code
Aug 01, 2025
Viaarxiv icon

AFRIDOC-MT: Document-level MT Corpus for African Languages

Add code
Jan 10, 2025
Figure 1 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Figure 2 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Figure 3 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Figure 4 for AFRIDOC-MT: Document-level MT Corpus for African Languages
Viaarxiv icon

From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes

Add code
Oct 30, 2024
Figure 1 for From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
Figure 2 for From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
Figure 3 for From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
Figure 4 for From Babble to Words: Pre-Training Language Models on Continuous Streams of Phonemes
Viaarxiv icon

Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing

Add code
Oct 15, 2024
Figure 1 for Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing
Figure 2 for Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing
Figure 3 for Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing
Figure 4 for Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing
Viaarxiv icon

Grammatical Error Correction for Code-Switched Sentences by Learners of English

Add code
Apr 18, 2024
Viaarxiv icon

Prompting open-source and commercial language models for grammatical error correction of English learner text

Add code
Jan 15, 2024
Figure 1 for Prompting open-source and commercial language models for grammatical error correction of English learner text
Figure 2 for Prompting open-source and commercial language models for grammatical error correction of English learner text
Figure 3 for Prompting open-source and commercial language models for grammatical error correction of English learner text
Figure 4 for Prompting open-source and commercial language models for grammatical error correction of English learner text
Viaarxiv icon

CLIMB: Curriculum Learning for Infant-inspired Model Building

Add code
Nov 15, 2023
Viaarxiv icon

On the application of Large Language Models for language teaching and assessment technology

Add code
Jul 17, 2023
Viaarxiv icon