Picture for Siddique Latif

Siddique Latif

Sparks of Large Audio Models: A Survey and Outlook

Add code
Sep 03, 2023
Viaarxiv icon

Cross-Language Speech Emotion Recognition Using Multimodal Dual Attention Transformers

Add code
Jul 14, 2023
Viaarxiv icon

Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers

Add code
Jul 12, 2023
Viaarxiv icon

A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model

Add code
May 19, 2023
Viaarxiv icon

Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing

Add code
May 01, 2023
Viaarxiv icon

Lightweight Toxicity Detection in Spoken Language: A Transformer-based Approach for Edge Devices

Add code
Apr 22, 2023
Viaarxiv icon

MESAHA-Net: Multi-Encoders based Self-Adaptive Hard Attention Network with Maximum Intensity Projections for Lung Nodule Segmentation in CT Scan

Add code
Apr 04, 2023
Viaarxiv icon

Transformers in Speech Processing: A Survey

Add code
Mar 21, 2023
Viaarxiv icon

Generative Emotional AI for Speech Emotion Recognition: The Case for Synthetic Emotional Speech Augmentation

Add code
Jan 10, 2023
Viaarxiv icon

MEDS-Net: Self-Distilled Multi-Encoders Network with Bi-Direction Maximum Intensity projections for Lung Nodule Detection

Add code
Oct 30, 2022
Viaarxiv icon