Picture for Xiaorui Wang

Xiaorui Wang

Adaptive Integral Sliding Mode Control for Attitude Tracking of Underwater Robots With Large Range Pitch Variations in Confined Space

Add code
May 01, 2024
Viaarxiv icon

Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation

Add code
Apr 17, 2024
Viaarxiv icon

Filter Pruning via Filters Similarity in Consecutive Layers

Add code
Apr 26, 2023
Viaarxiv icon

Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis

Add code
Mar 14, 2023
Figure 1 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 2 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 3 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Figure 4 for Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
Viaarxiv icon

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

Add code
Dec 13, 2022
Figure 1 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 2 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 3 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Figure 4 for Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
Viaarxiv icon

Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

Add code
Nov 17, 2022
Viaarxiv icon

Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition

Add code
Sep 17, 2022
Figure 1 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 2 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 3 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 4 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Viaarxiv icon

MELONS: generating melody with long-term structure using transformers and structure graph

Add code
Nov 03, 2021
Figure 1 for MELONS: generating melody with long-term structure using transformers and structure graph
Figure 2 for MELONS: generating melody with long-term structure using transformers and structure graph
Figure 3 for MELONS: generating melody with long-term structure using transformers and structure graph
Figure 4 for MELONS: generating melody with long-term structure using transformers and structure graph
Viaarxiv icon

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification

Add code
Sep 18, 2021
Figure 1 for SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification
Figure 2 for SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification
Figure 3 for SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification
Figure 4 for SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification
Viaarxiv icon

Dynamic Multi-scale Convolution for Dialect Identification

Add code
Aug 02, 2021
Figure 1 for Dynamic Multi-scale Convolution for Dialect Identification
Figure 2 for Dynamic Multi-scale Convolution for Dialect Identification
Figure 3 for Dynamic Multi-scale Convolution for Dialect Identification
Figure 4 for Dynamic Multi-scale Convolution for Dialect Identification
Viaarxiv icon