Picture for Parisa Haghani

Parisa Haghani

Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking

Add code
Oct 31, 2024
Figure 1 for Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking
Figure 2 for Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking
Figure 3 for Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking
Figure 4 for Schema Augmentation for Zero-Shot Domain Adaptation in Dialogue State Tracking
Viaarxiv icon

ASTRA: Aligning Speech and Text Representations for Asr without Sampling

Add code
Jun 10, 2024
Figure 1 for ASTRA: Aligning Speech and Text Representations for Asr without Sampling
Figure 2 for ASTRA: Aligning Speech and Text Representations for Asr without Sampling
Figure 3 for ASTRA: Aligning Speech and Text Representations for Asr without Sampling
Figure 4 for ASTRA: Aligning Speech and Text Representations for Asr without Sampling
Viaarxiv icon

Audio-AdapterFusion: A Task-ID-free Approach for Efficient and Non-Destructive Multi-task Speech Recognition

Add code
Oct 17, 2023
Viaarxiv icon

Using Text Injection to Improve Recognition of Personal Identifiers in Speech

Add code
Aug 14, 2023
Figure 1 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 2 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 3 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Figure 4 for Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Viaarxiv icon

Universal Automatic Phonetic Transcription into the International Phonetic Alphabet

Add code
Aug 07, 2023
Viaarxiv icon

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

Add code
Mar 03, 2023
Viaarxiv icon

Accelerating RNN-T Training and Inference Using CTC guidance

Add code
Oct 29, 2022
Viaarxiv icon

Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification

Add code
Sep 13, 2022
Figure 1 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 2 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 3 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Figure 4 for Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification
Viaarxiv icon

A Language Agnostic Multilingual Streaming On-Device ASR System

Add code
Aug 29, 2022
Figure 1 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 2 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 3 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 4 for A Language Agnostic Multilingual Streaming On-Device ASR System
Viaarxiv icon

Unsupervised Data Selection via Discrete Speech Representation for ASR

Add code
Apr 05, 2022
Figure 1 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 2 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 3 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Figure 4 for Unsupervised Data Selection via Discrete Speech Representation for ASR
Viaarxiv icon