Picture for Rajesh Sharma

Rajesh Sharma

Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution

Add code
Dec 23, 2024
Viaarxiv icon

ProvocationProbe: Instigating Hate Speech Dataset from Twitter

Add code
Oct 25, 2024
Viaarxiv icon

SeQuiFi: Mitigating Catastrophic Forgetting in Speech Emotion Recognition with Sequential Class-Finetuning

Add code
Oct 16, 2024
Figure 1 for SeQuiFi: Mitigating Catastrophic Forgetting in Speech Emotion Recognition with Sequential Class-Finetuning
Figure 2 for SeQuiFi: Mitigating Catastrophic Forgetting in Speech Emotion Recognition with Sequential Class-Finetuning
Viaarxiv icon

Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks

Add code
Oct 16, 2024
Figure 1 for Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks
Figure 2 for Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks
Figure 3 for Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks
Figure 4 for Multi-View Multi-Task Modeling with Speech Foundation Models for Speech Forensic Tasks
Viaarxiv icon

Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals

Add code
Oct 16, 2024
Figure 1 for Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals
Figure 2 for Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals
Figure 3 for Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals
Figure 4 for Beyond Speech and More: Investigating the Emergent Ability of Speech Foundation Models for Classifying Physiological Time-Series Signals
Viaarxiv icon

Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection

Add code
Sep 24, 2024
Figure 1 for Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection
Figure 2 for Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection
Figure 3 for Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection
Figure 4 for Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection
Viaarxiv icon

Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection

Add code
Sep 22, 2024
Figure 1 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection
Figure 2 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection
Figure 3 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection
Figure 4 for Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection
Viaarxiv icon

Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition

Add code
Sep 21, 2024
Figure 1 for Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition
Figure 2 for Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition
Figure 3 for Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition
Figure 4 for Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition
Viaarxiv icon

Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models

Add code
Sep 21, 2024
Figure 1 for Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models
Figure 2 for Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models
Figure 3 for Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models
Figure 4 for Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models
Viaarxiv icon

A Fine-grained Sentiment Analysis of App Reviews using Large Language Models: An Evaluation Study

Add code
Sep 11, 2024
Figure 1 for A Fine-grained Sentiment Analysis of App Reviews using Large Language Models: An Evaluation Study
Figure 2 for A Fine-grained Sentiment Analysis of App Reviews using Large Language Models: An Evaluation Study
Figure 3 for A Fine-grained Sentiment Analysis of App Reviews using Large Language Models: An Evaluation Study
Figure 4 for A Fine-grained Sentiment Analysis of App Reviews using Large Language Models: An Evaluation Study
Viaarxiv icon