Picture for Jiahong Yuan

Jiahong Yuan

Automated Tone Transcription and Clustering with Tone2Vec

Add code
Oct 03, 2024
Viaarxiv icon

Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models

Add code
Sep 25, 2024
Viaarxiv icon

Data-Driven Adaptive Simultaneous Machine Translation

Add code
Apr 27, 2022
Figure 1 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 2 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 3 for Data-Driven Adaptive Simultaneous Machine Translation
Figure 4 for Data-Driven Adaptive Simultaneous Machine Translation
Viaarxiv icon

Automatic recognition of suprasegmentals in speech

Add code
Aug 04, 2021
Figure 1 for Automatic recognition of suprasegmentals in speech
Figure 2 for Automatic recognition of suprasegmentals in speech
Figure 3 for Automatic recognition of suprasegmentals in speech
Figure 4 for Automatic recognition of suprasegmentals in speech
Viaarxiv icon

The Role of Phonetic Units in Speech Emotion Recognition

Add code
Aug 02, 2021
Figure 1 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 2 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 3 for The Role of Phonetic Units in Speech Emotion Recognition
Figure 4 for The Role of Phonetic Units in Speech Emotion Recognition
Viaarxiv icon

Decoupling recognition and transcription in Mandarin ASR

Add code
Aug 02, 2021
Figure 1 for Decoupling recognition and transcription in Mandarin ASR
Figure 2 for Decoupling recognition and transcription in Mandarin ASR
Figure 3 for Decoupling recognition and transcription in Mandarin ASR
Figure 4 for Decoupling recognition and transcription in Mandarin ASR
Viaarxiv icon

Text2Video: Text-driven Talking-head Video Synthesis with Phonetic Dictionary

Add code
Apr 29, 2021
Figure 1 for Text2Video: Text-driven Talking-head Video Synthesis with Phonetic Dictionary
Figure 2 for Text2Video: Text-driven Talking-head Video Synthesis with Phonetic Dictionary
Figure 3 for Text2Video: Text-driven Talking-head Video Synthesis with Phonetic Dictionary
Figure 4 for Text2Video: Text-driven Talking-head Video Synthesis with Phonetic Dictionary
Viaarxiv icon

Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training

Add code
Oct 21, 2020
Figure 1 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 2 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 3 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Figure 4 for Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training
Viaarxiv icon

On the Role of Style in Parsing Speech with Neural Models

Add code
Oct 08, 2020
Figure 1 for On the Role of Style in Parsing Speech with Neural Models
Figure 2 for On the Role of Style in Parsing Speech with Neural Models
Figure 3 for On the Role of Style in Parsing Speech with Neural Models
Figure 4 for On the Role of Style in Parsing Speech with Neural Models
Viaarxiv icon