Picture for Xuan Shi

Xuan Shi

Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling

Add code
Sep 14, 2024
Figure 1 for Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling
Figure 2 for Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling
Figure 3 for Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling
Figure 4 for Egocentric Speaker Classification in Child-Adult Dyadic Interactions: From Sensing to Computational Modeling
Viaarxiv icon

Toward Fully-End-to-End Listened Speech Decoding from EEG Signals

Add code
Jun 12, 2024
Figure 1 for Toward Fully-End-to-End Listened Speech Decoding from EEG Signals
Figure 2 for Toward Fully-End-to-End Listened Speech Decoding from EEG Signals
Figure 3 for Toward Fully-End-to-End Listened Speech Decoding from EEG Signals
Figure 4 for Toward Fully-End-to-End Listened Speech Decoding from EEG Signals
Viaarxiv icon

TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality

Add code
Apr 27, 2024
Viaarxiv icon

Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content

Add code
Jun 13, 2023
Viaarxiv icon

A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness

Add code
Dec 18, 2022
Viaarxiv icon

Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?

Add code
Nov 25, 2022
Figure 1 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 2 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 3 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 4 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Viaarxiv icon

Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms

Add code
Jul 24, 2021
Figure 1 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Figure 2 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Figure 3 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Figure 4 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Viaarxiv icon

RepGN:Object Detection with Relational Proposal Graph Network

Add code
Apr 18, 2019
Figure 1 for RepGN:Object Detection with Relational Proposal Graph Network
Figure 2 for RepGN:Object Detection with Relational Proposal Graph Network
Figure 3 for RepGN:Object Detection with Relational Proposal Graph Network
Figure 4 for RepGN:Object Detection with Relational Proposal Graph Network
Viaarxiv icon

End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking

Add code
Jan 02, 2019
Figure 1 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Figure 2 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Figure 3 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Figure 4 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Viaarxiv icon