Picture for Ramani Duraiswami

Ramani Duraiswami

3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering

Add code
Jan 14, 2025
Viaarxiv icon

TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification

Add code
Dec 31, 2024
Viaarxiv icon

Applying Automatic Differentiation to Optimize Differential Microphone Array Designs

Add code
Dec 06, 2024
Viaarxiv icon

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Add code
Oct 24, 2024
Figure 1 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Figure 2 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Figure 3 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Figure 4 for MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Viaarxiv icon

Biomimetic Frontend for Differentiable Audio Processing

Add code
Sep 13, 2024
Viaarxiv icon

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Add code
Sep 13, 2024
Figure 1 for ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Figure 2 for ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Figure 3 for ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Figure 4 for ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
Viaarxiv icon

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Add code
Jun 17, 2024
Figure 1 for GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Figure 2 for GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Figure 3 for GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Figure 4 for GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
Viaarxiv icon

LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition

Add code
Jun 06, 2024
Figure 1 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Figure 2 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Figure 3 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Figure 4 for LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
Viaarxiv icon

FAST: Factorizable Attention for Speeding up Transformers

Add code
Feb 12, 2024
Viaarxiv icon

A Closer Look at the Limitations of Instruction Tuning

Add code
Feb 03, 2024
Viaarxiv icon