Picture for Takuya Higuchi

Takuya Higuchi

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

Add code
Sep 16, 2024
Viaarxiv icon

Towards Automatic Assessment of Self-Supervised Speech Models using Rank

Add code
Sep 16, 2024
Viaarxiv icon

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Add code
Sep 16, 2024
Viaarxiv icon

Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations

Add code
Apr 05, 2024
Viaarxiv icon

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?

Add code
Feb 01, 2024
Viaarxiv icon

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Add code
Jan 30, 2024
Viaarxiv icon

Multichannel Voice Trigger Detection Based on Transform-average-concatenate

Add code
Sep 27, 2023
Viaarxiv icon

Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study

Add code
Sep 27, 2023
Viaarxiv icon

Improving Voice Trigger Detection with Metric Learning

Add code
Apr 05, 2022
Figure 1 for Improving Voice Trigger Detection with Metric Learning
Figure 2 for Improving Voice Trigger Detection with Metric Learning
Figure 3 for Improving Voice Trigger Detection with Metric Learning
Figure 4 for Improving Voice Trigger Detection with Metric Learning
Viaarxiv icon

Multi-task Learning with Cross Attention for Keyword Spotting

Add code
Jul 15, 2021
Figure 1 for Multi-task Learning with Cross Attention for Keyword Spotting
Figure 2 for Multi-task Learning with Cross Attention for Keyword Spotting
Figure 3 for Multi-task Learning with Cross Attention for Keyword Spotting
Figure 4 for Multi-task Learning with Cross Attention for Keyword Spotting
Viaarxiv icon