Picture for Mingxuan Wang

Mingxuan Wang

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

Add code
Dec 10, 2024
Viaarxiv icon

SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Add code
Feb 20, 2024
Figure 1 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Figure 2 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Figure 3 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Figure 4 for SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion
Viaarxiv icon

Speech Translation with Large Language Models: An Industrial Practice

Add code
Dec 21, 2023
Viaarxiv icon

Only 5\% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation

Add code
Sep 25, 2023
Viaarxiv icon

BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training

Add code
Jul 10, 2023
Viaarxiv icon

Recent Advances in Direct Speech-to-text Translation

Add code
Jun 20, 2023
Viaarxiv icon

MOSPC: MOS Prediction Based on Pairwise Comparison

Add code
Jun 18, 2023
Viaarxiv icon

Understanding Parameter Sharing in Transformers

Add code
Jun 15, 2023
Figure 1 for Understanding Parameter Sharing in Transformers
Figure 2 for Understanding Parameter Sharing in Transformers
Figure 3 for Understanding Parameter Sharing in Transformers
Figure 4 for Understanding Parameter Sharing in Transformers
Viaarxiv icon

PolyVoice: Language Models for Speech to Speech Translation

Add code
Jun 13, 2023
Figure 1 for PolyVoice: Language Models for Speech to Speech Translation
Figure 2 for PolyVoice: Language Models for Speech to Speech Translation
Figure 3 for PolyVoice: Language Models for Speech to Speech Translation
Figure 4 for PolyVoice: Language Models for Speech to Speech Translation
Viaarxiv icon

MobileNMT: Enabling Translation in 15MB and 30ms

Add code
Jun 07, 2023
Viaarxiv icon