Picture for Ding Ma

Ding Ma

Learning Emotion-discriminative Representations for Zero-Shot Cross-lingual Speech Emotion Recognition

Add code
Jun 04, 2026
Viaarxiv icon

Advancing Electrolaryngeal Speech Enhancement Through Speech-Text Representation Learning

Add code
Jun 01, 2026
Viaarxiv icon

Gaze into the Details: Locality-Sensitive Enhancement for OCTA Retinal Vessel Segmentation

Add code
May 20, 2026
Viaarxiv icon

Beyond Polarity: Multi-Dimensional LLM Sentiment Signals for WTI Crude Oil Futures Return Prediction

Add code
Mar 12, 2026
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Figure 1 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 2 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 3 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Figure 4 for Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Viaarxiv icon

Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions

Add code
Sep 29, 2024
Figure 1 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Figure 2 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Figure 3 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Figure 4 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Viaarxiv icon

Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders

Add code
Sep 18, 2023
Figure 1 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Figure 2 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Figure 3 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Figure 4 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Viaarxiv icon

Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

Add code
Nov 02, 2022
Figure 1 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 2 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 3 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 4 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Viaarxiv icon

Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion

Add code
Oct 19, 2022
Figure 1 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Figure 2 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Figure 3 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Figure 4 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Viaarxiv icon

TCDCaps: Visual Tracking via Cascaded Dense Capsules

Add code
Feb 26, 2019
Figure 1 for TCDCaps: Visual Tracking via Cascaded Dense Capsules
Figure 2 for TCDCaps: Visual Tracking via Cascaded Dense Capsules
Figure 3 for TCDCaps: Visual Tracking via Cascaded Dense Capsules
Figure 4 for TCDCaps: Visual Tracking via Cascaded Dense Capsules
Viaarxiv icon