Speech Recognition


Speech recognition is the task of identifying words spoken aloud, analyzing the voice and language, and accurately transcribing the words.

Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning

Add code
Apr 16, 2025
Viaarxiv icon

Dysarthria Normalization via Local Lie Group Transformations for Robust ASR

Add code
Apr 16, 2025
Viaarxiv icon

Real-Time Word-Level Temporal Segmentation in Streaming Speech Recognition

Add code
Apr 15, 2025
Viaarxiv icon

Spatial Audio Processing with Large Language Model on Wearable Devices

Add code
Apr 11, 2025
Viaarxiv icon

From Speech to Summary: A Comprehensive Survey of Speech Summarization

Add code
Apr 10, 2025
Viaarxiv icon

Visual-Aware Speech Recognition for Noisy Scenarios

Add code
Apr 09, 2025
Viaarxiv icon

RNN-Transducer-based Losses for Speech Recognition on Noisy Targets

Add code
Apr 09, 2025
Viaarxiv icon

Exploring Local Interpretable Model-Agnostic Explanations for Speech Emotion Recognition with Distribution-Shift

Add code
Apr 07, 2025
Viaarxiv icon

DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation

Add code
Apr 07, 2025
Viaarxiv icon

LinTO Audio and Textual Datasets to Train and Evaluate Automatic Speech Recognition in Tunisian Arabic Dialect

Add code
Apr 03, 2025
Viaarxiv icon