Picture for Sakriani Sakti

Sakriani Sakti

A Transformer Framework for Simultaneous Segmentation, Classification, and Caller Identification of Marmoset Vocalization

Add code
Nov 06, 2024
Viaarxiv icon

A Neural Transformer Framework for Simultaneous Tasks of Segmentation, Classification, and Caller Identification of Marmoset Vocalization

Add code
Oct 30, 2024
Viaarxiv icon

Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities

Add code
Oct 11, 2024
Viaarxiv icon

Contrastive Feedback Mechanism for Simultaneous Speech Translation

Add code
Jul 31, 2024
Viaarxiv icon

On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition

Add code
Jul 31, 2024
Viaarxiv icon

NAIST Simultaneous Speech Translation System for IWSLT 2024

Add code
Jun 30, 2024
Viaarxiv icon

SpeeChain: A Speech Toolkit for Large-Scale Machine Speech Chain

Add code
Jan 08, 2023
Viaarxiv icon

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

Add code
Dec 20, 2022
Viaarxiv icon

Instance-level Heterogeneous Domain Adaptation for Limited-labeled Sketch-to-Photo Retrieval

Add code
Dec 06, 2022
Viaarxiv icon

Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos

Add code
Aug 27, 2022
Figure 1 for Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos
Figure 2 for Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos
Figure 3 for Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos
Figure 4 for Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos
Viaarxiv icon