Picture for Guoqing Zhao

Guoqing Zhao

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

Add code
Mar 19, 2024
Viaarxiv icon

SELM: Speech Enhancement Using Discrete Tokens and Language Models

Add code
Dec 15, 2023
Figure 1 for SELM: Speech Enhancement Using Discrete Tokens and Language Models
Figure 2 for SELM: Speech Enhancement Using Discrete Tokens and Language Models
Figure 3 for SELM: Speech Enhancement Using Discrete Tokens and Language Models
Figure 4 for SELM: Speech Enhancement Using Discrete Tokens and Language Models
Viaarxiv icon

Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning

Add code
Oct 26, 2023
Figure 1 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 2 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 3 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Figure 4 for Multi-Speaker Expressive Speech Synthesis via Semi-supervised Contrastive Learning
Viaarxiv icon

Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification

Add code
Oct 07, 2023
Figure 1 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Figure 2 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Figure 3 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Figure 4 for Multi-objective Progressive Clustering for Semi-supervised Domain Adaptation in Speaker Verification
Viaarxiv icon

Haha-Pod: An Attempt for Laughter-based Non-Verbal Speaker Verification

Add code
Sep 25, 2023
Viaarxiv icon

VoxBlink: X-Large Speaker Verification Dataset on Camera

Add code
Aug 23, 2023
Viaarxiv icon

The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023

Add code
Aug 17, 2023
Viaarxiv icon

The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023

Add code
Aug 17, 2023
Figure 1 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 2 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 3 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Figure 4 for The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 2023
Viaarxiv icon

The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task

Add code
Jul 10, 2023
Figure 1 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Figure 2 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Figure 3 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Figure 4 for The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task
Viaarxiv icon

TreeMAN: Tree-enhanced Multimodal Attention Network for ICD Coding

Add code
May 29, 2023
Viaarxiv icon