Picture for Saksham Singh Kushwaha

Saksham Singh Kushwaha

VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation

Add code
Dec 14, 2024
Viaarxiv icon

Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models

Add code
Oct 15, 2024
Figure 1 for Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models
Figure 2 for Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models
Figure 3 for Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models
Figure 4 for Diff-SAGe: End-to-End Spatial Audio Generation Using Diffusion Models
Viaarxiv icon

Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions

Add code
Sep 17, 2023
Viaarxiv icon

A Multimodal Prototypical Approach for Unsupervised Sound Classification

Add code
Jun 21, 2023
Viaarxiv icon