Picture for Xin Jing

Xin Jing

MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge

Add code
Jan 08, 2025
Figure 1 for MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge
Figure 2 for MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge
Figure 3 for MAD-UV: The 1st INTERSPEECH Mice Autism Detection via Ultrasound Vocalization Challenge
Viaarxiv icon

Audio-based Kinship Verification Using Age Domain Conversion

Add code
Oct 14, 2024
Viaarxiv icon

On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction

Add code
Sep 25, 2024
Figure 1 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Figure 2 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Figure 3 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Figure 4 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Viaarxiv icon

CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models

Add code
Sep 25, 2024
Figure 1 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Figure 2 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Figure 3 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Figure 4 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Viaarxiv icon

Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models

Add code
Sep 10, 2024
Viaarxiv icon

ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks

Add code
Jun 11, 2024
Viaarxiv icon

DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition

Add code
Jun 11, 2024
Figure 1 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Figure 2 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Figure 3 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Figure 4 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Viaarxiv icon

STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition

Add code
Feb 02, 2024
Figure 1 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 2 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 3 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 4 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Viaarxiv icon

U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech

Add code
May 22, 2023
Viaarxiv icon

HEAR4Health: A blueprint for making computer audition a staple of modern healthcare

Add code
Jan 25, 2023
Viaarxiv icon