Picture for Alexander Waibel

Alexander Waibel

MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models

Add code
Nov 27, 2024
Viaarxiv icon

Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS

Add code
Oct 19, 2024
Viaarxiv icon

Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck

Add code
Oct 15, 2024
Viaarxiv icon

Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems

Add code
Sep 30, 2024
Figure 1 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Figure 2 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Figure 3 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Figure 4 for Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems
Viaarxiv icon

Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages

Add code
Aug 05, 2024
Figure 1 for Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages
Figure 2 for Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages
Figure 3 for Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages
Figure 4 for Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages
Viaarxiv icon

Handling Numeric Expressions in Automatic Speech Recognition

Add code
Jul 18, 2024
Viaarxiv icon

Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024

Add code
Jun 24, 2024
Viaarxiv icon

SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading

Add code
Jun 14, 2024
Figure 1 for SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
Figure 2 for SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
Figure 3 for SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
Figure 4 for SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
Viaarxiv icon

Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation

Add code
May 07, 2024
Viaarxiv icon

From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions

Add code
Feb 27, 2024
Viaarxiv icon