Picture for Mahesh Kumar Nandwana

Mahesh Kumar Nandwana

SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers

Add code
Nov 15, 2024
Viaarxiv icon

Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation

Add code
Jun 14, 2024
Viaarxiv icon

Enhancing Multilingual Voice Toxicity Detection with Speech-Text Alignment

Add code
Jun 14, 2024
Figure 1 for Enhancing Multilingual Voice Toxicity Detection with Speech-Text Alignment
Figure 2 for Enhancing Multilingual Voice Toxicity Detection with Speech-Text Alignment
Figure 3 for Enhancing Multilingual Voice Toxicity Detection with Speech-Text Alignment
Figure 4 for Enhancing Multilingual Voice Toxicity Detection with Speech-Text Alignment
Viaarxiv icon