Picture for Xiaorui Wang

Xiaorui Wang

A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions

Add code
Dec 12, 2024
Viaarxiv icon

Learning and Current Prediction of PMSM Drive via Differential Neural Networks

Add code
Dec 12, 2024
Viaarxiv icon

Adaptive Integral Sliding Mode Control for Attitude Tracking of Underwater Robots With Large Range Pitch Variations in Confined Space

Add code
May 01, 2024
Viaarxiv icon

Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation

Add code
Apr 17, 2024
Viaarxiv icon

Filter Pruning via Filters Similarity in Consecutive Layers

Add code
Apr 26, 2023
Viaarxiv icon

Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis

Add code
Mar 14, 2023
Figure 1 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 2 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 3 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 4 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Viaarxiv icon

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

Add code
Dec 13, 2022
Figure 1 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 2 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 3 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 4 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Viaarxiv icon

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

Add code
Nov 17, 2022
Viaarxiv icon

Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition

Add code
Sep 17, 2022
Figure 1 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 2 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 3 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 4 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Viaarxiv icon

MELONS: generating melody with long-term structure using transformers and structure graph

Add code
Nov 03, 2021
Figure 1 for MELONS: generating melody with long-term structure using transformers and structure graph
Figure 2 for MELONS: generating melody with long-term structure using transformers and structure graph
Figure 3 for MELONS: generating melody with long-term structure using transformers and structure graph
Figure 4 for MELONS: generating melody with long-term structure using transformers and structure graph
Viaarxiv icon