Picture for Xiaohai Tian

Xiaohai Tian

SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation

Add code
Nov 27, 2024
Viaarxiv icon

Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation

Add code
Sep 25, 2024
Figure 1 for Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation
Figure 2 for Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation
Figure 3 for Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation
Figure 4 for Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation
Viaarxiv icon

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

Add code
Jun 19, 2024
Viaarxiv icon

CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing

Add code
Jan 22, 2024
Viaarxiv icon

Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring

Add code
May 19, 2023
Viaarxiv icon

Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring

Add code
Mar 13, 2023
Viaarxiv icon

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

Add code
Mar 13, 2023
Viaarxiv icon

TTS-Guided Training for Accent Conversion Without Parallel Data

Add code
Dec 20, 2022
Viaarxiv icon

Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information

Add code
Mar 01, 2022
Figure 1 for Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information
Figure 2 for Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information
Figure 3 for Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information
Figure 4 for Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information
Viaarxiv icon

The Multi-speaker Multi-style Voice Cloning Challenge 2021

Add code
Apr 05, 2021
Figure 1 for The Multi-speaker Multi-style Voice Cloning Challenge 2021
Figure 2 for The Multi-speaker Multi-style Voice Cloning Challenge 2021
Viaarxiv icon