Picture for Jinhan Wang

Jinhan Wang

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Add code
Jan 26, 2024
Viaarxiv icon

Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech Signals

Add code
Jun 06, 2023
Viaarxiv icon

Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR

Add code
Apr 28, 2023
Viaarxiv icon

A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement

Add code
Jun 29, 2022
Figure 1 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 2 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 3 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 4 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Viaarxiv icon

Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals

Add code
Jun 27, 2022
Figure 1 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 2 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 3 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Figure 4 for Unsupervised Instance Discriminative Learning for Depression Detection from Speech Signals
Viaarxiv icon

VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition

Add code
Feb 22, 2022
Figure 1 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 2 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 3 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Figure 4 for VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition
Viaarxiv icon

FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals

Add code
Feb 11, 2022
Figure 1 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 2 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 3 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Figure 4 for FrAUG: A Frame Rate Based Data Augmentation Method for Depression Detection from Speech Signals
Viaarxiv icon

Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System

Add code
Jun 18, 2021
Figure 1 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 2 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 3 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Viaarxiv icon