Picture for John Harvill

John Harvill

Compressing Sequences in the Latent Embedding Space: $K$-Token Merging for Large Language Models

Add code
Apr 16, 2026
Viaarxiv icon

Voxtral TTS

Add code
Mar 26, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition

Add code
Aug 11, 2024
Figure 1 for LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition
Figure 2 for LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition
Figure 3 for LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition
Figure 4 for LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition
Viaarxiv icon

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

Add code
Aug 16, 2023
Figure 1 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Figure 2 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Figure 3 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Figure 4 for Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction
Viaarxiv icon

INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition

Add code
May 25, 2023
Figure 1 for INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition
Figure 2 for INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition
Figure 3 for INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition
Figure 4 for INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition
Viaarxiv icon

SPADE: Self-supervised Pretraining for Acoustic DisEntanglement

Add code
Feb 03, 2023
Figure 1 for SPADE: Self-supervised Pretraining for Acoustic DisEntanglement
Figure 2 for SPADE: Self-supervised Pretraining for Acoustic DisEntanglement
Figure 3 for SPADE: Self-supervised Pretraining for Acoustic DisEntanglement
Figure 4 for SPADE: Self-supervised Pretraining for Acoustic DisEntanglement
Viaarxiv icon

SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation

Add code
Dec 21, 2022
Figure 1 for SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation
Figure 2 for SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation
Figure 3 for SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation
Figure 4 for SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation
Viaarxiv icon