Picture for Jisi Zhang

Jisi Zhang

Exploring compressibility of transformer based text-to-music (TTM) models

Add code
Jun 24, 2024
Viaarxiv icon

Locality enhanced dynamic biasing and sampling strategies for contextual ASR

Add code
Jan 23, 2024
Viaarxiv icon

Consistency Based Unsupervised Self-training For ASR Personalisation

Add code
Jan 22, 2024
Viaarxiv icon

On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer

Add code
Jul 25, 2023
Figure 1 for On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer
Figure 2 for On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer
Figure 3 for On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer
Figure 4 for On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer
Viaarxiv icon

On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training

Add code
May 03, 2022
Figure 1 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 2 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 3 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Figure 4 for On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training
Viaarxiv icon

Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation

Add code
Jun 16, 2021
Figure 1 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 2 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 3 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Figure 4 for Teacher-Student MixIT for Unsupervised and Semi-supervised Speech Separation
Viaarxiv icon

Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism

Add code
Feb 07, 2021
Figure 1 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 2 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 3 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Figure 4 for Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism
Viaarxiv icon

On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments

Add code
Nov 11, 2020
Figure 1 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Figure 2 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Figure 3 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Figure 4 for On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments
Viaarxiv icon