Picture for Yuanyuan Zhang

Yuanyuan Zhang

radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction

Add code
Oct 11, 2024
Figure 1 for radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction
Figure 2 for radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction
Figure 3 for radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction
Figure 4 for radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction
Viaarxiv icon

radarODE: An ODE-Embedded Deep Learning Model for Contactless ECG Reconstruction from Millimeter-Wave Radar

Add code
Aug 03, 2024
Viaarxiv icon

AI-Based Beam-Level and Cell-Level Mobility Management for High Speed Railway Communications

Add code
Jul 05, 2024
Viaarxiv icon

Improving child speech recognition with augmented child-like speech

Add code
Jun 12, 2024
Figure 1 for Improving child speech recognition with augmented child-like speech
Figure 2 for Improving child speech recognition with augmented child-like speech
Figure 3 for Improving child speech recognition with augmented child-like speech
Viaarxiv icon

Exploring data augmentation in bias mitigation against non-native-accented speech

Add code
Dec 24, 2023
Viaarxiv icon

Acoustic Model Fusion for End-to-end Speech Recognition

Add code
Oct 10, 2023
Figure 1 for Acoustic Model Fusion for End-to-end Speech Recognition
Figure 2 for Acoustic Model Fusion for End-to-end Speech Recognition
Figure 3 for Acoustic Model Fusion for End-to-end Speech Recognition
Figure 4 for Acoustic Model Fusion for End-to-end Speech Recognition
Viaarxiv icon

Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers

Add code
May 23, 2023
Viaarxiv icon

Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition

Add code
Nov 17, 2021
Figure 1 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Figure 2 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Figure 3 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Figure 4 for Information Fusion in Attention Networks Using Adaptive and Multi-level Factorized Bilinear Pooling for Audio-visual Emotion Recognition
Viaarxiv icon

Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition

Add code
Dec 27, 2020
Figure 1 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Figure 2 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Figure 3 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Figure 4 for Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
Viaarxiv icon

Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems

Add code
Dec 07, 2020
Figure 1 for Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems
Figure 2 for Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems
Figure 3 for Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems
Figure 4 for Frame-level SpecAugment for Deep Convolutional Neural Networks in Hybrid ASR Systems
Viaarxiv icon