Picture for Mark Hasegawa-Johnson

Mark Hasegawa-Johnson

R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate

Add code
Oct 21, 2024
Viaarxiv icon

Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility

Add code
Sep 29, 2024
Viaarxiv icon

Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue

Add code
Sep 07, 2024
Viaarxiv icon

LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition

Add code
Aug 11, 2024
Viaarxiv icon

Sound Tagging in Infant-centric Home Soundscapes

Add code
Jun 25, 2024
Viaarxiv icon

Towards Unsupervised Speech Recognition Without Pronunciation Models

Add code
Jun 12, 2024
Viaarxiv icon

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion

Add code
Mar 31, 2024
Viaarxiv icon

AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition

Add code
Mar 18, 2024
Viaarxiv icon

Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations

Add code
Feb 10, 2024
Viaarxiv icon

HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models

Add code
Nov 30, 2023
Viaarxiv icon