Picture for Duc Le

Duc Le

Jack

Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens

Add code
Oct 04, 2024
Viaarxiv icon

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Add code
Sep 13, 2024
Figure 1 for Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Figure 2 for Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Figure 3 for Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Figure 4 for Seed-Music: A Unified Framework for High Quality and Controlled Music Generation
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding

Add code
Jun 12, 2024
Viaarxiv icon

Seq2seq for Automatic Paraphasia Detection in Aphasic Speech

Add code
Dec 16, 2023
Viaarxiv icon

StemGen: A music generation model that listens

Add code
Dec 14, 2023
Viaarxiv icon

A Foundation Model for Music Informatics

Add code
Nov 06, 2023
Viaarxiv icon

Scaling Up Music Information Retrieval Training with Semi-Supervised Learning

Add code
Oct 02, 2023
Viaarxiv icon

Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding

Add code
Jul 22, 2023
Viaarxiv icon

Text Generation with Speech Synthesis for ASR Data Augmentation

Add code
May 22, 2023
Viaarxiv icon