Picture for Rajiv Ratn Shah

Rajiv Ratn Shah

JOOCI: a Framework for Learning Comprehensive Speech Representations

Add code
Oct 14, 2024
Viaarxiv icon

RConE: Rough Cone Embedding for Multi-Hop Logical Query Answering on Multi-Modal Knowledge Graphs

Add code
Aug 21, 2024
Viaarxiv icon

Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation

Add code
Aug 20, 2024
Viaarxiv icon

Multilingual Non-Factoid Question Answering with Silver Answers

Add code
Aug 20, 2024
Viaarxiv icon

Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities

Add code
Jul 08, 2024
Figure 1 for Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities
Figure 2 for Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities
Figure 3 for Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities
Figure 4 for Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities
Viaarxiv icon

Keystroke Dynamics Against Academic Dishonesty in the Age of LLMs

Add code
Jun 21, 2024
Viaarxiv icon

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing

Add code
Jun 13, 2024
Viaarxiv icon

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech

Add code
Jun 12, 2024
Viaarxiv icon

MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations

Add code
Jun 09, 2024
Viaarxiv icon

LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs

Add code
May 02, 2024
Viaarxiv icon