Picture for Gene-Ping Yang

Gene-Ping Yang

A Simple HMM with Self-Supervised Representations for Phone Segmentation

Add code
Sep 15, 2024
Figure 1 for A Simple HMM with Self-Supervised Representations for Phone Segmentation
Figure 2 for A Simple HMM with Self-Supervised Representations for Phone Segmentation
Figure 3 for A Simple HMM with Self-Supervised Representations for Phone Segmentation
Figure 4 for A Simple HMM with Self-Supervised Representations for Phone Segmentation
Viaarxiv icon

Towards Matching Phones and Speech Representations

Add code
Oct 26, 2023
Viaarxiv icon

On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation

Add code
Jul 06, 2023
Viaarxiv icon

Supervised Attention in Sequence-to-Sequence Models for Speech Recognition

Add code
Apr 25, 2022
Figure 1 for Supervised Attention in Sequence-to-Sequence Models for Speech Recognition
Viaarxiv icon

Self-supervised Pre-training Reduces Label Permutation Instability of Speech Separation

Add code
Oct 29, 2020
Figure 1 for Self-supervised Pre-training Reduces Label Permutation Instability of Speech Separation
Figure 2 for Self-supervised Pre-training Reduces Label Permutation Instability of Speech Separation
Figure 3 for Self-supervised Pre-training Reduces Label Permutation Instability of Speech Separation
Figure 4 for Self-supervised Pre-training Reduces Label Permutation Instability of Speech Separation
Viaarxiv icon

Interrupted and cascaded permutation invariant training for speech separation

Add code
Oct 28, 2019
Figure 1 for Interrupted and cascaded permutation invariant training for speech separation
Figure 2 for Interrupted and cascaded permutation invariant training for speech separation
Figure 3 for Interrupted and cascaded permutation invariant training for speech separation
Figure 4 for Interrupted and cascaded permutation invariant training for speech separation
Viaarxiv icon

Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering

Add code
Apr 16, 2019
Figure 1 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Figure 2 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Figure 3 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Figure 4 for Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering
Viaarxiv icon