Picture for Xin Jing

Xin Jing

Audio-based Kinship Verification Using Age Domain Conversion

Add code
Oct 14, 2024
Viaarxiv icon

CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models

Add code
Sep 25, 2024
Figure 1 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Figure 2 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Figure 3 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Figure 4 for CasFT: Future Trend Modeling for Information Popularity Prediction with Dynamic Cues-Driven Diffusion Models
Viaarxiv icon

On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction

Add code
Sep 25, 2024
Figure 1 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Figure 2 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Figure 3 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Figure 4 for On Your Mark, Get Set, Predict! Modeling Continuous-Time Dynamics of Cascades for Information Popularity Prediction
Viaarxiv icon

Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models

Add code
Sep 10, 2024
Viaarxiv icon

ParaCLAP -- Towards a general language-audio model for computational paralinguistic tasks

Add code
Jun 11, 2024
Viaarxiv icon

DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition

Add code
Jun 11, 2024
Figure 1 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Figure 2 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Figure 3 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Figure 4 for DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition
Viaarxiv icon

STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition

Add code
Feb 02, 2024
Figure 1 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 2 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 3 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 4 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Viaarxiv icon

U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech

Add code
May 22, 2023
Viaarxiv icon

HEAR4Health: A blueprint for making computer audition a staple of modern healthcare

Add code
Jan 25, 2023
Viaarxiv icon

Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression

Add code
Jun 28, 2022
Figure 1 for Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression
Figure 2 for Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression
Figure 3 for Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression
Viaarxiv icon