Picture for Saurabhchand Bhati

Saurabhchand Bhati

State-Space Large Audio Language Models

Add code
Nov 24, 2024
Viaarxiv icon

DASS: Distilled Audio State Space Models Are Stronger and More Duration-Scalable Learners

Add code
Jul 04, 2024
Viaarxiv icon

Audio-Visual Neural Syntax Acquisition

Add code
Oct 11, 2023
Viaarxiv icon

Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

Add code
Sep 08, 2023
Figure 1 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 2 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 3 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Figure 4 for Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning
Viaarxiv icon

Regularizing Contrastive Predictive Coding for Speech Applications

Add code
Apr 26, 2023
Figure 1 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 2 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 3 for Regularizing Contrastive Predictive Coding for Speech Applications
Figure 4 for Regularizing Contrastive Predictive Coding for Speech Applications
Viaarxiv icon

Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

Add code
Jan 28, 2022
Figure 1 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 2 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 3 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 4 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Viaarxiv icon

Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding

Add code
Oct 08, 2021
Figure 1 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 2 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 3 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Figure 4 for Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding
Viaarxiv icon

Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation

Add code
Jun 03, 2021
Figure 1 for Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Figure 2 for Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Figure 3 for Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Figure 4 for Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation
Viaarxiv icon

Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery

Add code
Jul 26, 2020
Figure 1 for Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Figure 2 for Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Figure 3 for Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Viaarxiv icon