Picture for Mark Hasegawa-Johnson

Mark Hasegawa-Johnson

SiamCTC: Learning Speech Representations through Monotonic Temporal Alignment

Add code
Jun 01, 2026
Viaarxiv icon

Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding

Add code
May 30, 2026
Viaarxiv icon

PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning

Add code
May 13, 2026
Viaarxiv icon

Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech

Add code
Mar 16, 2026
Viaarxiv icon

Towards Robust Dysarthric Speech Recognition: LLM-Agent Post-ASR Correction Beyond WER

Add code
Jan 29, 2026
Viaarxiv icon

SICL-AT: Another way to adapt Auditory LLM to low-resource task

Add code
Jan 26, 2026
Viaarxiv icon

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

Add code
Jan 25, 2026
Viaarxiv icon

TICL+: A Case Study On Speech In-Context Learning for Children's Speech Recognition

Add code
Dec 20, 2025
Viaarxiv icon

That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation

Add code
Oct 21, 2025
Figure 1 for That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation
Figure 2 for That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation
Figure 3 for That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation
Figure 4 for That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation
Viaarxiv icon

The Interspeech 2025 Speech Accessibility Project Challenge

Add code
Jul 29, 2025
Viaarxiv icon