Picture for Sarthak Yadav

Sarthak Yadav

Audio xLSTMs: Learning Self-Supervised Audio Representations with xLSTMs

Add code
Sep 02, 2024
Viaarxiv icon

Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations

Add code
Jun 04, 2024
Viaarxiv icon

Masked Autoencoders with Multi-Window Attention Are Better Audio Learners

Add code
Jun 01, 2023
Figure 1 for Masked Autoencoders with Multi-Window Attention Are Better Audio Learners
Figure 2 for Masked Autoencoders with Multi-Window Attention Are Better Audio Learners
Figure 3 for Masked Autoencoders with Multi-Window Attention Are Better Audio Learners
Figure 4 for Masked Autoencoders with Multi-Window Attention Are Better Audio Learners
Viaarxiv icon

Learning neural audio features without supervision

Add code
Mar 29, 2022
Figure 1 for Learning neural audio features without supervision
Figure 2 for Learning neural audio features without supervision
Figure 3 for Learning neural audio features without supervision
Figure 4 for Learning neural audio features without supervision
Viaarxiv icon

GISE-51: A scalable isolated sound events dataset

Add code
Mar 23, 2021
Figure 1 for GISE-51: A scalable isolated sound events dataset
Figure 2 for GISE-51: A scalable isolated sound events dataset
Figure 3 for GISE-51: A scalable isolated sound events dataset
Figure 4 for GISE-51: A scalable isolated sound events dataset
Viaarxiv icon

Frequency and temporal convolutional attention for text-independent speaker recognition

Add code
Oct 19, 2019
Figure 1 for Frequency and temporal convolutional attention for text-independent speaker recognition
Figure 2 for Frequency and temporal convolutional attention for text-independent speaker recognition
Figure 3 for Frequency and temporal convolutional attention for text-independent speaker recognition
Figure 4 for Frequency and temporal convolutional attention for text-independent speaker recognition
Viaarxiv icon