Picture for Guanglu Wan

Guanglu Wan

AS-Speech: Adaptive Style For Speech Synthesis

Add code
Sep 09, 2024
Viaarxiv icon

MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research

Add code
Jun 26, 2024
Figure 1 for MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research
Figure 2 for MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research
Figure 3 for MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research
Figure 4 for MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research
Viaarxiv icon

CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs

Add code
May 27, 2024
Viaarxiv icon

Learning or Self-aligning? Rethinking Instruction Fine-tuning

Add code
Mar 02, 2024
Viaarxiv icon

A Task-oriented Dialog Model with Task-progressive and Policy-aware Pre-training

Add code
Oct 01, 2023
Viaarxiv icon

CPPF: A contextual and post-processing-free model for automatic speech recognition

Add code
Sep 21, 2023
Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon

Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter

Add code
Sep 19, 2023
Figure 1 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 2 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 3 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Figure 4 for Enhancing Multilingual Speech Recognition through Language Prompt Tuning and Frame-Level Language Adapter
Viaarxiv icon

Exploiting Pseudo Future Contexts for Emotion Recognition in Conversations

Add code
Jun 27, 2023
Viaarxiv icon

Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation

Add code
Apr 03, 2023
Figure 1 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 2 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 3 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Figure 4 for Dialog-to-Actions: Building Task-Oriented Dialogue System via Action-Level Generation
Viaarxiv icon

Covariance Regularization for Probabilistic Linear Discriminant Analysis

Add code
Dec 06, 2022
Viaarxiv icon