Picture for Vassilis Katsouros

Vassilis Katsouros

Meltemi: The first open Large Language Model for Greek

Add code
Jul 30, 2024
Viaarxiv icon

The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data

Add code
Jun 21, 2024
Viaarxiv icon

Weakly-supervised Automated Audio Captioning via text only training

Add code
Sep 21, 2023
Figure 1 for Weakly-supervised Automated Audio Captioning via text only training
Figure 2 for Weakly-supervised Automated Audio Captioning via text only training
Figure 3 for Weakly-supervised Automated Audio Captioning via text only training
Figure 4 for Weakly-supervised Automated Audio Captioning via text only training
Viaarxiv icon

Investigating Personalization Methods in Text to Music Generation

Add code
Sep 20, 2023
Figure 1 for Investigating Personalization Methods in Text to Music Generation
Figure 2 for Investigating Personalization Methods in Text to Music Generation
Figure 3 for Investigating Personalization Methods in Text to Music Generation
Figure 4 for Investigating Personalization Methods in Text to Music Generation
Viaarxiv icon

Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling

Add code
May 30, 2023
Viaarxiv icon

Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek

Add code
Dec 31, 2022
Viaarxiv icon

Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss

Add code
Apr 28, 2022
Figure 1 for Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss
Figure 2 for Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss
Figure 3 for Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss
Figure 4 for Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss
Viaarxiv icon

Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition

Add code
Apr 01, 2022
Figure 1 for Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition
Figure 2 for Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition
Figure 3 for Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition
Figure 4 for Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition
Viaarxiv icon